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(54) Novel cDNAs encoding catenin-binding proteins with function in signalling and/or gene regulation 



(57) The invention relates to the field of drug 

discovery, diagnosis, prognosis and treatment of cancer 
and neurological disorders. The invention provides 
among others access to and insight in protein-protein 
or protein-DNA interactions in a signal transduction or 
transcriptional pathway controlling cell growth or 
development throughout a wide range of cells and 
tissues of the body, and provides means, such as 
nucleic acid, protein, cells and experimental animals 
and methods to identify candidate drugs, for example 
for use in therapy of cancer or neurological disorders. 
As an example of a catenin-binding protein with 
function in intracellular signalling or gene regulation, 
the invention provides an isolated and/or recombinant 
nucleic acid or a functional fragment, homologue or 
derivative thereof, corresponding to a zinc finger gene 
with a nucleic acid sequence as shown in figure 1 and 



encoding a zinc finger protein, or fragment thereof, 
capable of complexing with a neurally expressed catenin. 
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[0001] The invention relates to the field of drug discovery, diagnosis and treatment of cancer and neurological 
disorders. • 

5 [0002] Despite extensive knowledge relating to the multitude of cancer forms (varying in appearance from solid 
tumours and related metastases in distinct parts of the body to leukaemia's of blood cells that circulate throughout 
the body, and varying from being totally benign to being aggressively malignant) effective therapy of cancer is 
difficult and in general restricted to three types; treatment via radiation, via chemotherapy and via surgery. 
Possibilities for a more specific therapy, directed against the underlying cause of a specific cancer or groups of 
cancers are currently virtually non-existing. Extensive efforts are directed at providing such specific drugs 

10 through drug discovery attempts that try to identify candidate drugs for specific therapy of cancer or neurological 
disorders. 

[0003] Development of cancer often starts with changes in a cell that lead to the unrestricted development and 
division of that first cell into an ever dividing population of cells. These changes are often an accumulation of 
mutations or other alterations in key genes that occur chronologically, whereby the mutated cell population looses 
its original, often specialised character and acquires more and more of a cancerous nature. Normal processes of 
growth regulation of cells are dysfunction ing in the altered cells. Transcription of genes that are normally only 
little expressed in said cell type is possibly no longer controlled in cancerous cells. 

[0004] Activation of transcription of genes by transcription factors that would otherwise be dormant in the 
specific cell type can for example lead to the so typical unrestricted growth and neoplastic nature of cancer. Examples 
are mutations in suppressor genes that function normally by generating proteins that are suppressing transcriptional 

20 pathways which are no longer of use in a specialised cell. Mutated suppressor genes do not longer help keep the 
growth of a cell under control. Drugs directed against or intervening with the specific protein-protein or protein- 
DNA interactions in transcriptional and/or signalling pathways controlling cell growth, cell differentiation or 
development can be considered typical candidate drugs for later use in specific therapy of cancer or neurological 
disorders, especially when such pathways have gone awry and lead to unrestricted growth or aberrant differentiation 
of cells. In the case of aN-catenin, which is mainly neurally expressed, a broader or additive interpretation in 

25 relationship to neural dysfunctions is obvious. 

[0005] The cadherin superfamily represents several cadherins with function in cell-cell adhesion, morphogenesis 
and tissue homeostasis (Takeichi, 1991; Kemler, 1992; Suzuki, 1996). The transmembrane glycoprotein E-cadherin is 
the best-studied prototype of this family and has been identified as a potent suppressor of invasion (Behrens et at., 
1989; Frixen et a!., 1991; Vleminckx et al., 1991). Recent studies revealed proof for a tumor suppressor role of 
human E-cadherin as the encoding gene behaves according to the two-hit model of Knudson (1985) in infiltrative 

30 lobular cancers (Berx et al. 1995 and 1996) and diffuse gastric cancers (Becker et al., 1994 and 1996). 

[0006] Cadherins function as cell-cell adhesion molecules by homophilic interactions with other cadherin 
molecules, but linkage to the actin cytoskeleton is also essential. The latter is achieved by the catenins (catena 
means chain) (Ozawa et al., 1990; Cowin, 1994), which comprise the Armadillo proteins (e.g. (3-catenin, plakoglobin 
and p120 ctn a) and the vinculin-like a-catenins. The Armadillo catenins are proteins, known to be associated with 

35 the cytoplasmic domain of cadherins. In turn, the a-catenins link p-catenin and plakoglobin to the actin cytoskeleton. 

[0007] These catenins were also found to be associated with the cytoplasmic tumor suppressor gene product APC 
(adenomatous polyposis coli) (Peifer, 1993; Su et al., 1993). A typical example of a signal transduction pathway 
(via p-catenin) gone wrong and leading to development of cancer, can be found with APC. The APC protein is linked to 
the microtubular cytoskeleton. Moreover, in the desmosomes, plakoglobin mediates a link between desmosomal 
cadherins and the cytokeratin cytoskeleton via desmoplakin (Korman et al., 1989; Kowalczyk et al., 1997). 

40 [0008] For a-catenin, two subtypes have been identified, aE-catenin (epithelial form) (Nagafuchi et al., 1991; 
Herrenknecht et al., 1991) and aN-catenin (neural form) (Hirano et al., 1992). Moreover, for both subtypes two 
isoforms, resulting from alternative splice events, were identified (Oda et al., 1993; Uchida et al., 1994; Rimm et 
al., 1994). A tissue specific distribution for either of the subtypes has been reported. The epithelial aE-catenin is 
expressed in a wide variety of tissues, but only low levels of expression were observed in the central nervous 

45 system (CNS) (Nagafuchi et al., 1991). In contrast, aN-catenin expression is more restricted to particular tissues 
including the nervous system in which it is generally expressed (Hirano et al., 1992; Uchida et al., 1994). 
[0009] One of the most intriguing discoveries in this field is the recently described association of LEF-1 
(lymphocyte enhancer-binding factor-1), an architectural transcription factor (Love et al., 1995) with p-catenin (Behrens 
et al., 1996). The interaction between p-catenin and LEF-1 leads to nuclear translocation of these two proteins, 
implicating a central role for p-catenin in the transcriptional regulation of target genes, which can lead to 

50 tumorigeneity (Huber et al., 1996; Peifer, 1997). Among the target genes induced by the p-catenin/LEF-1 complex are 
the myc proto-oncogene (He et al., 1998) and the cyclin D1 gene (Tetsu and McCormick, 1999). 
[0010] Cadherin-catenin-cytoskeleton complexes are key elements of cell-cell adhesion and regulation of motility, 
but it may be clear that the importance of nuclear signalling by catenins is gaining interest and may be critical in 
tumorigeneity, invasion and metastasis. However, despite all the knowledge of the cadherins and catenins it was not 
known what proteins are capable of translocating catenins to the nucleus, and how catenins might exert their effect 

55 on intracellular signalling and on the transcription of genes in the cell. Only with the means and methods of the 
current invention a key step has become apparent. Moreover, through the identification of a key step in the 
translocation of catenins to the nucleus it has now become possible to develop means and methods interfering with 
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said process in for instance tumorigeneity, invasion and metastasis of cells. 

[0011] The invention therefore provides among others access to and insight in protein-protein or protein-DNA 
interactions in a transcriptional pathway controlling cell growth or development throughout a wide range of cells 
and tissues of the body, and provides means, such as nucleic acid, protein, cells and experimental animals and 
methods to identify candidate drugs, for example for use in therapy of cancer and/or neurological disorders. Where 
in this application the definition "nucleic acid" is used, both RNA and DNA, in single or double-stranded fashion, 
and nucleic acid hybridising thereto is meant. "Homologue" herein meaning a related nucleic acid that can be found 
with another species. "Derivative" herein meaning a nucleic acid that has been derived by genetic modifications, 
such as deletions, insertions, and mutations from a distinct nucleic acid or fragment thereof. "Corresponding" 
herein meaning having a nucleic acid sequence homology of at least 80%, more preferably of at least 90%. The 
10 sequence similarities, obtained by the BLAST algorithm (Altschul et al., 1990) are given by P-scores (the more 
negative, the higher the similarity), not by percentages. Nevertheless, nucleotide sequence homology can be 
expressed as percentages (numbers of identical nucleotides per 100 nucleotides). 

The invention provides an isolated and/or recombinant nucleic acid or a functional fragment, homologue or derivative 
thereof, corresponding to a catenin-binding protein with function in signal transduction or gene regulatory pathways 
more specifically to an isolated and/or recombinant nucleic acid or a functional fragment, homologue or derivative 

15 thereof, corresponding to for example a zinc finger gene with a nucleic acid sequence as shown in figure 1 and 
encoding a zinc finger protein, or fragment thereof, said protein capable of complexing or interacting with catenin 
or fragments thereof. "Functional fragment" herein meaning a nucleic acid or part thereof that is functionally or 
structurally related to or hybridising with a distinct nucleic acid or fragment thereof. Typical examples of such a 
functional fragment as provided by the invention are DNA binding elements and/or subcellular localisation signals. 
For example, further characterisation of nucleic acid according to the invention revealed the presence of nucleic 

20 acid encoding protein fragments encoding Cys^iS;, zinc fingers with DNA binding properties. In addition, in yet 
another functional fragment, a nuclear localisation signal (NLS, such as PKKRKRK) is found. 

[0012] The invention also provides a nucleic acid according to the invention wherein said protein is capable of 
nuclear translocation of aN-catenin but not aE-catenin. Co-expression of a zinc finger protein as provided by the 
invention or a functional fragment thereof with particular catenins such as aN-catenin leads to a translocation of this 
25 catenin into the nucleus. A zinc finger protein as provided by the invention protein can for example be isolated in 
a two-hybrid screening, using human aN-catenin or another catenin as a bait, and is herein also called a catenin- 
binding protein. 

Up to now, there has been no report of zinc finger proteins binding to catenin or aN-catenin. Furthermore, the invention 
provides a nucleic acid or derivative thereof corresponding to a nucleic acid or functional fragment thereof 
encoding a zinc finger protein or other signalling or gene regulatory protein capable of interacting or binding with 
30 a catenin, for example an Armadillo catenin or an a-catenin. For example, the invention provides a zinc finger 
protein encoded by said nucleic acid interacting with catenin, for example with aN-catenin. The catenin protein, 
forming a complex together with the zinc finger protein, is involved in signalling and gene regulation in for 
example transcriptional pathways. 

[0013] In a preferred embodiment, the invention provides nucleic acid according to the invention which is a cDNA 
molecule, as for example described in the experimental part of this description. 
35 [0014] Furthermore, the invention provides an expression vector comprising a nucleic acid according to the 
invention. An example of such a vector can be found in the experimental part of the description. It is within the 
skills of the artisan to provide vectors that have been provided with single or multiple nucleic acid changes, 
deletions and/or insertions or other mutations of a nucleic acid sequence. 

[0015] Furthermore, the invention provides a cell capable of expressing the zinc finger protein or functional 
40 fragments thereof as provided by the invention. For example, the invention provides a cell comprising a genome in 
which a nucleic acid sequence corresponding to a nucleic acid according to the invention has been modified. Such a 
modification can comprise a (for example site-directed or transposon-directed or chemically induced) mutation of a 
nucleic acid encoding a (fragment) of a gene encoding an catenin-binding zinc finger protein, be it in the intronic 
or exonic sequences of said gene. 

[0016] The invention also provides a cell comprising a nucleic acid according to the invention that has been 
45 introduced via recombinant means known to the skilled artisan, for example via homologous recombination techniques 
or by using a vector according to the invention. An example of such a cell according to the invention is provided in 
the experimental part of the description wherein said cell is a yeast cell, such as for use in a two-hybrid 
interaction assay. 

[0017] The invention also provides a cell capable of expressing a catenin-binding protein or derivative or 
fragment thereof, said protein comprising a catenin binding domain or capable of complexing with catenin. Such a 

50 cell provided by the invention is for example derived of a cell comprising a genome in which a nucleic acid sequence 
corresponding to a nucleic acid according to the invention has been modified or derived of a cell comprising a 
nucleic acid according to the invention. Expression of proteins in recombinant cells is in itself a technique 
available to the average artisan. An example is provided in the experimental part wherein yeast cells expressing a 
catenin-binding protein are provided. 

55 [0018] Furthermore, the invention provides an animal comprising a cell according to the invention. Such an 
animal is for example a transgenic animal obtained by modifying an embryonic stem cell. Stem cell modifications, as 
well as modifications of other cells, are known to the average skilled artisan. Such an animal is for example a 
homozygous knock-out animal or a heterozygous animal, for example a cross between a knock-out animal with a wild 
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type animal or is otherwise modified in a nucleic acid fragment in its genome that is corresponding to a nucleic 
acid provided by the invention. A preferred embodiment of the invention is wherein said animal is a mouse. 
[0019] The invention also provides a zinc finger protein or derivative or fragment thereof, said protein capable 
of complexing with catenin, preferably with aN-catenin. Preferable, such a protein is derived by recombinant 
techniques and encoded by a nucleic acid according to the invention. Proteins, be it natural proteins or recombinant 

15 versions thereof can easily be isolated by a skilled artisan, for example when at least a part of the amino acid 
sequence, or a specific antibody directed against the protein, is provided. The invention provides such an isolated 
or recombinant catenin-binding zinc finger protein or (poly)peptide or derivatives or fragments thereof. In addition, 
the invention provides an antibody specifically directed against a such protein or (poly)peptide or derivative or 
fragment thereof according the invention. It is within the skills of the average skilled artisan to provide a 

10 (synthetic) antibody directed against a protein or fragment thereof, for example once the amino acid sequence or the 
isolated protein is provided. 

[0020] The invention also provides a method for identifying a candidate drug comprising use of a cell or an 
animal or a protein or an antibody according to the invention. Candidate drugs, often first selected or generated 
via combinatorial chemistry, can now be tested and identified using a method provided by the invention. Such a 
candidate drug or compound can for example be tested on and selected for their effect on zinc finger protein 
15 regulated nuclear translocation of catenin as for example measured according to the experimental part of the 
description. For example, a test compound or drug which inhibits the nuclear translocation of catenin is a candidate 
drug for therapy of cancer or neurological disorders. The mRNA encoding the zinc finger protein is found in ail 
human tissue examined so far, with a low expression level in most of the tissues and a stronger mRNA signal in 
pituitary gland and adrenal gland. Lung, placenta, fetal liver and fetal lung also showed expression of the mRNA, 
but lower than the pituitary gland and the adrenal gland (see section Multiple tissue RNA dot blot). The partial 
20 cDNA encoding the zinc finger protein was isolated from a human kidney cDNA library and the 5'RACE fragment was 
obtained using mammary gland mRNA. On the other hand, aN-catenin is known to be mainly neurally expressed. 
Furthermore, the invention provides a method for diagnosing cancer or neurological disorders comprising use of a 
nucleic acid, a cell or an animal, or a protein or an antibody according to the invention. A test or compound which 
inhibits either interaction of the zinc finger protein with aN-catenin or the interaction of the zinc finger protein with 
DNA-target sequences ; is also considered as a candidate drug for therapy of cancer or nueurological disorders. Other 
applications are simplified by discrimination between different types of cancer or neurological disorders, aiming at 
for instance improved diagnosis, prognosis and therapy. 

[0021] The invention is further explained in the experimental part without limiting the invention. 



25 
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Experimental part 



[0022] Here, we report the isolation, in a yeast two-hybrid screen, of a novel human zinc finger protein 
associating with a human isoform of a-catenin, termed aN-catenin. The interaction we observed is aN-catenin specific, 
since the epithelial aE-catenin does not interact with the zinc finger protein using the two-hybrid system. The cDNA 
of the isolated Cys 2 His 2 -type zinc finger protein was completed using 5' RACE technology. In human cells transfected 
with the former cDNA, the full length zinc finger protein localizes to the nucleus. In human cells transfected with 
35 a cDNA of aN-catenin, this protein is expressed in the cytoplasm. When these two cDNAs are co-expressed, both 
proteins colocalize in the nucleus. These results indicate the formation of a complex between the zinc finger 
protein and aN-catenin and a subsequent translocation to the nucleus. To date, there has been no report of a nuclear 
zinc finger, protein binding to aN-catenin, nor of nuclear localization of the latter The presence of the zinc finger 
domain at the carboxy-terminus. consisting of 9 Cys 2 His 2 zinc fingers with putative DNA binding properties, shows 
that the isolated zinc finger protein might bind to specific DNA sequences. Taking together with the knowledge that aN- 
40 catenin is expressed mainly in the neural system, it might play a role in the transcriptional regulation of target 
genes, in particular in tumors and in the neural system. 

MATERIALS AND METHODS 

A . Bacterial strains and cell lines 

45 

[0023] Escherichia coli DH5a (supE44, hsdR17, deoR, recA1, endA1, lacZDM15) and E. coli HB101 (supE44, 
mcrB, mrr, hsdS20, recA1) were used for transformations, plasmid propagation and isolation. The bacteria were grown 
in LB medium supplemented with 100 ug/ml ampicillin. For selection of the two-hybrid cDNA-library plasmid, 
transformed HB101 bacteria were grown on minimal M9 medium, supplemented with 50 ug/ml ampicillin, 40 ug/ml 
so proline, 1 mM thiamine-HCI and 1% of the appropriate amino acid drop out solution. After selection the bacteria were 
maintained in LB medium supplemented with 50 ug/ml ampicillin. 

[0024] The HEK293T, a human embryonic kidney cell line transfected with SV40 large T-antigen (SV40 tsA1609) 
(Graham et al., 1977; DuBridge et at., 1987) was kindly provided by Dr. M. Hall (University of Birmingham, UK), and 
was used for transient eukaryotic expression. HEK293T cells were grown at 37°C with 5% C0 2 in Dulbecco minimal 
essential medium (DMEM, Gibco BRL Life Technologies, Paisly, UK) supplemented with 10% FCS, 0.03% glutamine, 
55 100 U/ml penicillin, 100 mg/l streptomycin and 0.4 mM sodium pyruvate. 

[0025] GLC34 and GLC8 are cell lines derived from small cell iung carcinomas, established as described by de 
Leij et al. (1985), and kindley made available by Dr. Charles Buys and Dr. Lou de Leij (University of Groningen, 
the Netherlands). MKN45 is a gastric carcinoma cell line (Motoyama and Watanabe, 1983). 
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Plasmids and gene assembly 

[0026] Restriction enzymes were purchased from Gibco BRL Life Technologies (Paisley, UK) or from New England 
Biolabs (Beverly, MA, USA). Restriction enzymes were used according to manufacturers' recommendations. All PCR 

5 reactions were performed using Vent™ (Biolabs) DNA polymerase. The primers for PCR amplification were either home 
made (University of Ghent) or obtained from Gibco BRL. The standard PCR mixture, in a reaction volume of 100 pi, 
contained template cDNA (plasmid), 25 pmol of both specific primers, 200 uM dXTPs and the PCR buffer supplied with 
Vent™ DNA polymerase. Unless otherwise stated, no additional MgS0 4 was added. Vent™ DNA polymerase was used 
at 1 U/reaction. The DNA amplification was performed in the PTC-200 Peltier Thermal Cycler PCR System (MJ 

10 Research, Watertown, MA). The PCR program started with a DNA denaturating step at 94°C for 3 min, followed by 80°C 
for 1 min. Cycling conditions were 94°C for 1 min, 50-60°C for 30 sec and 72°C for 2 min. This was repeated for a 
total of 35 cycles and was followed by a final extension step at 72°C for 10 min. 

Construction of the plasmid encoding the aN-catenin GAL4-DBD hybrid protein for two-hybrid screening 

75 [0027] For the two-hybrid screen the almost full-length cDNA of human aN-catenin (residues 4-906) was fused in 
frame to the GAL4 DNA binding domain in the pAS2 vector (Matchmaker™, Clontech, Palo Alto, CA). This construct 
was called pAS2ANCTN and was created using amplification by polymerase chain reaction (PCR) and restriction 
fragments. For construction of this pAS2ANCTN two-hybrid bait plasmid, we used the expression plasmid 
pPNhANCTN, which contains the human aN-catenin coding sequence flanked by part of the 5' and 3'UTR. The cDNA 
for aN-catenin was kindly provided by Dr. C. Petit (Institut Pasteur, Paris; Claverie et al., 1993). Using a specific 

20 sense primer completed with the Xmal restriction site (underlined) 5'-ACCC CCCGGG GGCAACTTCACCTATCATTC-3' 
(= FVR137F; Table 4) and a compatible antisense primer 5'-GCCGCCGCCTTCCTTTTCATTTCCGCTCTT-3' (= 
FVR138R; Table 4), we amplified a PCR fragment from the pPNhANCTN plasmid containing the Xmal restriction site at 
the 5' end. This PCR fragment was digested with Xmal and Banl and ligated together with a Banl-Hindlll fragment of 
the aN-catenin cDNA in the Xmal-Hindlll digested pAS2 vector. The in-frame cloning was confirmed by DNA sequence 
analysis using the vector specific forward primer 5'-ATCATCGGAAGAGAGTAGTA-3' (= FVR175F; Table 2). To check 

25 for the insertion of the complete fragment, the constructed plasmid was also sequenced with a vector specific 
reverse primer 5-AAAATCATAAATCATAAGAA-3' (= FVR217R; Table 2). The plasmid was assayed for expression of 
the fusion protein in yeast using Western blot analysis with an antibody directed against the GAL4 DNA binding 
domain (anti GAL4 DBD rabbit polyclonal antiserum, UBI, Lake Placid, NY). 



30 



50 



Construction of the plasmids encoding fragments of aN-Catenin fused to the GAL4-DBD 



[0028] Several restriction fragments of pAS2ANCTN were subcloned into the pGBT9 (Clontech, Palo Alto, CA; 
Battel et al., 1993) to construct plasmids encoding different parts of the aN-catenin protein fused to the GAL4 DNA 
binding domain. An EcoRI-BamHI fragment (encoding residues 4-193), an EcoRI-Pstl fragment (encoding residues 4- 
535), a Pstl fragment (encoding residues 535-787) and a Xhol-Sall fragment (encoding residues 543-906) were isolated 
3 5 and ligated in frame with the GAL4 DNA binding domain in the pGBT9 vector digested with the appropriate restriction 
enzymes. The plasmids were designated pGBT9ANCTN(EcoRI-BamHI), pGBT9ANCTN(EcoRI-Pstl), 
pGBT9ANCTN(Psth 61 8-2375) and pGBT9ANCTN-(Xhol-Sall), respectively. The in-frame cloning of the fragments was 
confirmed by DNA sequence analysis using the vector specific forward primer 5-ATCATCGGAAGAGAGTAGTA-3' (= 
FVR175F; Table 2). The inserts were also sequenced with the vector-specific reverse primer (= FVR217R; Table 2) 5- 
AAAATC ATA-AATC ATAAG AA-3' . 

40 

Construction of the plasmid encoding aE-Catenin fused to the GAL4-DBD 

[0029] A human fetal kidney 5' Stretch cDNA library (Clontech, Palo Alto, CA) in vector J.DR2 was screened with a 
32 aP-labeled aE-caten in-specific probe. This resulted in the isolation of the pDR2aECTN plasmid (phages were 
converted in vivo into the pDR2-derived plasmids according to the manufacturers protocol), containing the full- 
45 length cDNA of aE-catenin. The plasmid was digested with Sail, Sphl and Eco47lll. The Sall/Sphl and the Eco47ll/Sphl 
fragments were ligated into the Smal/Sall digested pGBT9 vector. The two-hybrid plasmid was called pGBT9aECTN 
and was analyzed for in-frame cloning by DNA sequence analysis using the vector-specific forward primer FVR175F 
(Table 2). The insertion of the fragments was confirmed by sequencing the 3' end using the vector-specific primer 
FVR217R (Table 2). 



Construction of the plasmids encoding the amino-terminal part of A NC_2H01 fused to the GAL4-AD 



[0030] The ANC_2H01 clone, isolated from the human kidney cDNA library by performing a two-hybrid screen 
with aN-catenin as a bait, was digested with BamHI. The 700-bp fragment was isolated and subcloned into the BamHI 
digested pGAD10 vector (Clontech, Palo Alto, CA). The cDNA insert encodes for the amino-terminal part plus two of 
55 the nine zinc fingers. The in-frame cloning was confirmed by DNA sequence analysis. We designated the plasmid 
ANC_2H01/BamHI. 

[0031] Using amplification by PCR with compatible primers containing an additional restriction site, we 
subcloned parts of the ANC_2H01 cDNA encoding for the amino-terminal part of the protein, lacking any functional 
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zinc finger. For this construct, PCR amplification from the initial ANC_2H01 two-hybrid clone was performed using 
the forward primer 5'-GCACTATGGCCAGAAACAGAAATCAGA-3' (= FVR345F; Table 1), compatible with the reverse 
primer 5'-GGAAITCTGGGCAGTCACATTCAAAG-3' (= FVR346R; Table 1), which included an EcoRI restriction site 
(underlined). The amplified fragment was digested with Xbal/EcoRI. A second fragment was isolated as a BamHI/Xbal 
fragment of ANC_2H01. Both fragments were ligated in the two-hybrid vector pGADIO double digested by 
BamHI/EcoRI. We named the final plasmid ANC_2H01/500. 

[0032] Another construct, containing a cDNA fragment encoding the amino-terminal part of the initial ANC_2H01 
plus half of the first zinc finger, was made using the same strategy. We amplified a PCR fragment from ANC_2H01 
using the same forward primer (= FVR345F; Table 1) that was compatible with the reverse primer 5'- 
GGAAIICCATATGCTGCTTTAAGTCAG-3' (= FVR347R; Table 1), in which an EcoRI site (underlined) was included. 
The plasmid was called ANC_2H01/600. The in-frame cloning and the insertion of the fragments were confirmed by 
DNA sequence analysis, using primers FVR174F (Table 2) and FVR192R (Table 2) for each of the constructed 
plasmids. 

Construction of the plasmid for eukaryotic expression of epitope tagged protein . 

[0033] We constructed an expression vector for eukaryotic expression of the ANC_2H01 full-length protein fused 
to the E-tag. We used the Notl-Kpnl fragment of the vector pEFHOBES (a kind gift from M. Van de Craen, Department 
of Molecular Biology, VIB-University of Ghent, Belgium), consisting of the expression vector pEF-BOS (Mizushima and 
Nagata, 1990), in which the E-tag derived from plasmid pCANTABSE (Pharmacia), is inserted. To construct the 
expression vector pEFBOSANC_2H01E, which encodes for the ANC_2H01 protein fused at its amino-terminal side to 
the E-tag, a three points ligation was set up. The first fragment was amplified from the cloned 5' RACE product (in 
pGEMT) by using the forward primer 5 '-ATAAG AATGCGGCCGC TATGAATG AGTATC CTAAAA-3' (= FVR662F; Table 
1) which contains a Notl restriction site (underlined) and a compatible reverse primer (= FVR663R; Table 1) 5- 
CGGATACAGCATAGCGTAGAAAAGGCAGTGTGGTC-3'. The amplified fragment was digested with Notl and AiwNI. 
The second fragment was obtained by digestion of the initial two-hybrid clone of ANC_2H01 with Xhol and Ncol and 
ligation of this fragment into the Xhoi-Ncol digested vector pJRD184 (Heusterspreute et al., 1985; John Davis, 
personal communication). This clone was finally used for the isolation of the AlwNI-Kpnl fragment of the ANC_2H01 
cDNA. Both fragments were ligated into the Notl-Kpnl fragment of the vector pEFHOBES. The in-frame cloning was 
confirmed by DNA sequence analysis with the ANC_2H01 -specific primer 5-TCTGTTTCTGGCCTTGATTC-3' 
(=FVR310R; Table 1). 

[0034] Another three points ligation was set up to fuse the ANC_2H01 protein carboxyterminally to the E-tag. The 
vector used was the pDNATRADE (gift from Dr. W. Declercq and B. Depuydt, Department of Molecular Biology, VIB- 
University of Gent, Belgium), which was digested with the Notl and Hindlll restriction enzymes. The Hindlll site was 
completely filled in. The second fragment was amplified from the initial two-hybrid clone ANC_2H01 using the forward 
primer 5-ATCGTCAGCGACATAGGTCAATGGAATTTTCTCTGAT-3' (= FVR660F; Table 1) and the compatible reverse 
primer 5 -ATAAG AATGCGGCCGC TGTTGTCTCATGGACTGGAAG-3' (= FVR661R; Table 1), containing a Notl 
restriction site (underlined). The obtained PCR fragment was digested with AlwNI and Notl restriction enzymes. The 
third fragment was obtained from the digestion of pGEMTRACEIC with Bsu36l and AlwNI. The constructed plasmid 
was called pDNA-ANC_2H01E. 

Construction of plasmids encoding aN/aE-catenin chimeras fused to the GAL4-DBD. 

[0035] We constructed six different plasmids in which different parts of either aN-catenin cDNA oraE-catenin cDNA 
were exchanged with the homologous part of, respectively, the aE-catenin cDNA or the aN-catenin cDNA. This was 
done by a combination of PCR products and restriction fragments. A first fragment was amplified from the 
PGBT9ANCTN plasmid by using the forward primer 5'-CG GAATTC CCGGGGGCAACTTC-3' (= FVR1241; Table 6) 
which contains an EcoRI restriction site (underlined), and a compatible reverse primer 5- 
TCATTAAGAG CAIAIGCCAGCT -3' (= FVR1243; Table 6) which includes a new Ndel restriction site (underlined). The 
PCR product was digested with the appropriate restriction enzymes and subsequently ligated into the EcoRI-Ndel 
digested pGBT9a ECTN plasmid. This construct was named pGBT9aNCTNVH 1 E referring to the vinculin-homologous 
domain 1 (VH1) of a N-catenin that is replaced by that of aE-catenin (Fig. 10) . 

[0036] To construct pGBT9aECTNVH 1 N, a fragment was amplified from the pGBT9aECTN plasmid using the 
forward primer 5'-AATT CCCGGG CGCCCAGCTAGC-3' (= FVR1244; Table 6) comprising an Xmal restriction site 
(underlined) and a compatible reverse primer 5-TCCTCCAGGGA CGGCCG AAAGC-3' (= FVR1245; Table 6) which 
includes an Eagl restriction site (underlined). Subsequently, the amplified fragment was digested with the 
appropriate restriction enzymes and ligated into the fragment obtained by Xmal-Eagl digestion of pGBT9ANCTN. The 
constructed plasmid is called pGBT9a ECTNVH1N and contains the cDNA of aE-catenin in which the VH1 domain 
of aE-catenin is replaced by that of aN-catenin (Fig. 10). 

[0037] The vinculin-homologous domain 2 of aN-catenin (VH2N) was amplified from the pGBT9ANCTN plasmid 
using the forward primer 5V\GGTTCCGGCCGTCCCTGCA-3' (= FVR1246; Table 6) and a compatible reverse primer 
5-GGAATATC GGTACC TGCTCAGC-3' (= FVR1247; Table 6), including, respectively, an Eagl and a Kpnl restriction 
site (underlined). A second fragment containing the vinculin-homologous domain 3 of aE-catenin (VH3E) was generated 
by PCR from the pGBT9aECTN plasmid, using the forward primer 5'- AACAGGTACCCAGCTTCCAGG-3' (= FVR1252; 
Table 6) which includes a Kpnl restriction site (underlined) and the reverse primer 5'-CTTGGCTGCAG GTCGAC TCT-3' 
(= FVR1253; Table 6), containing a Sail restriction site (underlined). These two PCR fragments were digested with 
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the appropriate restriction enzymes and subsequently ligated into the Eagl-Sall digested pGBT9aNCTNVH 1 E. The 
constructed plasmid contains the cDNA of aE-catenin in which the VH2 is replaced by that of aN-catenin and is 
therefore named pGBT9uECTNVH2N (Fig. 10). These same two digested PGR fragments were ligated into the Eagl- 
Sall digested pGBT9ANCTN plasmid to obtain the pGBT9aNCTNVH3E plasmid, in which the VH3 domain of aN- 
catenin is replaced by that of aE-catenin (Fig. 10). 

5 [0038] A PCR fragment was generated from the pGBT9aECTN plasmid using a forward primer 5- 
ACTGGCAIAIGCACTCAATAAC-3' (= FVR1250, Table 6), containing an Ndel restriction site (underlined), and a 
compatible reverse primer 5'-CCTGGAAGCTG GGTACC TGTTC-3' (= FVR1251; Table 6), including a Kpnl restriction 
site (underlined). A second fragment was generated by PCR from pGBT9ANCTN using the forward primer 5- 
GCTGAGCA GGTACC GATATTCC-3' (= FVR1248F; Table 6) and a reverse primer 5'- 

10 TTGGCTGCAG GTCGAC GGTATC-3' (= FVR1249R; Table 6), including, respectively, a Kpnl and a Sail restriction 
site (underlined). These two PCR fragments were digested with the appropriate restriction enzymes and ligated into 
the Ndel-Sall digested pGBT9a ECTNVH1N plasmid to obtain the pGBT9aNCTNVH2N (Fig. 10). To construct the 
pGBT9«ECTNVH3N plasmid, these same two PCR fragments were ligated into the Ndel-Sall digested pGBT9aECTN 
(Fig. 10). 

[0039] The in-frame cloning of all the different fragments was confirmed by DNA sequence analysis of the 
15 constructs, using primers FVR51, FVR54, FVR157, FVR160, FVR217, FVR738, FVR1157 and FVR1311 (Table 6). 

Construction of plasmids encoding about full-length ANC 2H01 fused to either the GAL4-DBD or GAL4-AD. 

[0040] To construct the two-hybrid plasmid, which encodes about the full-length ANC_2H01 protein fused to the 
GAL4-DBD, a three points ligation was set up. The first fragment was amplified from the cloned 5' RACE product (in 
20 pGEMT) by using the forward primer 5-GGAAHCCTGAATGAGTATCCTAAAAAA-3' (= FVR1237F; Table 1) which 
contains a new EcoRI restriction site (underlined), and a compatible reverse primer 5'- 
ATGCATGCTGTAGAAAAGGCAGTGTGGT-3'(= FVR1238R; Table 1). The amplified fragment was digested with EcoRI 
and AlwNI. A second fragment was generated by PCR from the initial two-hybrid clone ANC_2H01 using the forward 
primer 5'-CGTCGCGGCCCTGCAGATGGATTCAATGGA-3' (= FVR1240F; Table 1) and a compatible reverse primer 5- 
TC C C C C C G G G GGGATGAATTTATTATTTTA-3' (= FVR1242R, Table 1), extended with an Xmal restriction site 
(underlined). The latter PCR fragment was digested with AlwNI and Xmal. Both digested PCR fragments were ligated 
into the EcoRI-Xmal fragment of the vectors pGBT9 and pGAD424. These constructed plasmids were called 
pGBT9ANC_2H01FL and pGAD424ANC_2H01FL, respectively. The in-frame cloning of the fragments was confirmed 
by DNA sequence analysis for both plasmids. 

30 Construction of plasmids encoding either the aminoterminal or the carboxyterminal part of ANC 2H01 fused to the 
GAL4-DBD. 

[0041] A fragment was amplified from the expression vector pEFBOSANC_2H01E using the forward primer 5'- 
TCC CCCGGG TATGAATGAGTATCCTAAAAAA-3' (= FVR1411F; Table 1), extended with an Xmal restriction site 
(underlined), and the compatible reverse primer 5'-AAAA GTCGAC GGCCACTGCTATTAGCTCTC-3' extended with a 

35 Sail restriction site (underlined) (= FVR1412R; Table 1). The fragment was digested with the Xmal and Sail 
restriction enzymes and ligated into the Xmal-Sall digested pGBT9 vector. The constructed two-hybrid plasmid encodes 
the aminoterminal, non-zinc finger part of ANC_2H01 fused to the GAL4-DBD and is called pGBT9ANC_2H01-AT. 
[0042] In order to fuse the carboxyterminal, zinc finger part of ANC_2H01 to the GAL4-DBD, another fragment was 
amplified from the pDNA-ANC_2H01E plasmid, using the forward primer 5-G GAATTC TTCTATAAATGTGAACTTTGT- 
3' (= FVR1413F; Tablel), which includes an EcoRI restriction site, and the compatible reverse primer 5- 

40 AAAAGTCGACAAGTTAAAGAGAATAATCAA-3', extended with a Sail restriction site (= FVR1414R). This PCR 
fragment was digested with the EcoRI and Sail restriction enzymes and subsequently ligated into the EcoRI-Sall 
fragment of pGBT9. This plasmid was named pGBT9ANC_2H01-ZF. 

Yeast strains and media 

45 [0043] The Saccharomyces cerevisiae strain HF7c (Mata, ura3-52, his3-A200, ade2-101, lys2-801, trp1-901, Ieu2- 
3, 112, ga!4-542, gal80-538, lys2::GAL1-HIS3, URA3::GAL4 17-mers)-CYC1-LacZ) (Matchmaker™, Clontech, Palo 
Alto ; CA) was used for most assays. The HF7c yeast strain carries two reporter genes, HIS3 and LacZ, both integrated 
into the yeast genome and under the control of GAL4 responsive elements, respectively, the GAL1 UAS and the UAS 
G-i7mer It has also two auxotrophic markers, trp1 and Ieu2, which are used for plasmid selection upon 

50 transformation. Yeast cultures were grown at 30°C in either complete YPD medium (1% yeast extract, 2% peptone and 
2% glucose) or in SD minimal medium (0.5% yeast nitrogen base without amino acids, 2% glucose, and 1% of the 
appropriate amino acid drop out solution). 

[0044] The Saccharomyces cerevisiae strain Y190 (Mata, ura3-52, his3-200, ade2-101, Iys2-801, trp1-901, Ieu2- 
3, 112, gal4A, ga!80A, cyh r a2, LYS2::GAL1 UAS -HIS3 TATA -HIS3, U R A: : GAL 1 UAS -GAL 1 TATA -lacZ) (Matchmaker™, 
Clontech, Palo Alto, CA) was used to perform control experiments. The Y190 strain exhibits a significant level of 
55 constitutive leaky expression of the HIS3 reporter gene. This background can be repressed by including 40 mM 3- 
aminotriazole (3-AT) in the medium. This strain also contains an integrated LacZ reporter gene under the control of 
the GAL4 responsive elements in the GAL1 UAS and GAL1 minimal promoter. This results a high level of LacZ 
expression when induced by a positive two-hybrid interaction. In vivo assay on agar plates can be performed with this 
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strain, unlike strain HF7c for which colony transfer to filters is needed (see below). 
Yeast transformation and p-galactosidase filter assay 

[0045] Plasmids encoding the GAL4 hybrid proteins were introduced into the HF7c yeast reporter strain by the 
5 lithium acetate (LiAc) transformation procedure (Gietz et al., 1992, MATCHMAKER Yeast protocol Handbook). 
Transformants were selected for the presence of the plasmids by growing on appropriate media at 30°C. They were 
allowed to grow until the colonies were large enough to perform a p-galactosidase filter assay, usually 3-4 days. The 
transformed cells were then transferred onto a 82-mm nitrocellulose membrane (Sartorius, Goettingen, Germany), 
permeabilized by freezing the membranes in liquid nitrogen for one minute and followed by thawing at room 
1Q temperature. The membranes are soaked with 1.5 ml of Z-buffer containing 5-bromo-4-chloro-3-indolyl-P-D- 
galactosidase (X-gal) and incubated at 30°C until the appearance of blue colonies. This takes usually 30 min to 12 
hours. The membranes are then dried, analyzed and stored. 

[0046] Using the Y190 strain, the p-galactosidase assay was done in vivo, on agar plates containing SD minimal 
medium lacking Trp, Leu and His (1,5 % agar, 0.5% yeast nitrogen base without amino acids, 2% glucose, and 1% of 
the appropriate amino acid drop out solution), completed with 0.07 M potassium phosphate pH=7, 40 mM 3-AT and X- 
15 gal (80 mg/ml). 

Two-hybrid cDNA library screening and DNA sequence analysis 

[0047] The plasmid pAS2ANCTN was used to screen a human kidney cDNA library, cloned in the GAL4 activation 
domain vector pGAD10 (Clontech, Palo Alto, CA). The plasmids were introduced into the HF7c yeast strain by using 

20 sequential transformation by the lithium acetate (LiAc) method described by Gietz et al. (1992) The interaction 
screen was carried out in the yeast strain HF7c on media lacking leucine, tryptophan, histidine, and containing 5 mM 
3-aminotriazole (3-AT). The P-galactosidase activity in yeast was measured using a p-galactosidase filter assay. 
[0048] Yeasts harboring interacting proteins were used for plasmid isolation. The obtained plasmid mixture was 
transformed into Escherichia coli HB101 electrocompetent cells. HB101 has a defect in the leuB gene which can be 

25 complemented by LEU2 from yeast. So, this strain can be used for selection of the library plasmid, which carries the 
yeast LEU2 transformation marker. From these transformants, the library plasmids were isolated and introduced into 
Escherichia coli DH5a. Plasmids isolated from this strain were further characterized by DNA sequence. This was done 
by the dideoxy chain termination method (Sanger, 1981) using fluorescent dye terminators in a 373A or a 377 
automated DNA sequencer (Applied Biosystems, Foster City, CA). The sequences were further analyzed by BLAST 
search (Altshul et al., 1990) and the DNAstar software packages (DNASTAR Inc, Madison, USA), and by the Staden 

30 gap4 software (Bonfield et al., 1995). To sequence the cDNA insert of the library plasmid, two primers were designed 
on the pGAD10 vector sequence flanking the cDNA insert: a forward primer 5'-ACCACTACAATGGATGATGT-3' (= 
FVR174F; Table 2) and a reverse primer 5'-TAAAA-GAAGGCAAAACGATG-3' (= FVR192R; Table 2). 

Northern blot analysis 

35 [0049] The total length of the complete mRNA encoding the zinc finger protein was estimated by Northern blot 
analysis. Total mRNA of various human cell lines was isolated by using RNA-zol B (Wak Chemie-Medical, Bad 
Homburg, Germany). Total mRNA (15 \iq) was glyoxylated, size fractioned on a 1% agarose gel and transferred to a 
Hybond-N+ membrane (Amersham). The probe was radioactively labeled using random priming (RadPrime DNA 
labeling system; Gibco BRL Life Technologies, Gent, Belgium). The probe used was a 700-bp BamHI fragment of the 
isolated two-hybrid clone ANC_2H01, and covered the amino-terminal part and two of the 9 zinc fingers. 

4( > Hybridizations were performed as described elsewhere (Bussemakers et al., 1991). 

Multiple tissue RNA dot blot analysis 

[0050] A human RNA Master Blot™ was purchased from Clontech and hybridized according to the 
manufacturer's instructions. The probe was a 700-bp BamHI restriction fragment of the originally isolated two- 
4o hybrid clone ANC_2H01. The labeling was executed by use of the Strip-EZ™ DNA labeling kit (Ambion). Hybridization 
conditions were as described for the Northern blot analysis. The ExpressHyb™ hybridization solution was purchased 
from Clontech. The blot was exposed for 7 days to a P-imager screen (Molecular Dynamics). 

Western blot analysis 

50 

[0051] Cells were lysed by boiling in sample buffer in the presence of 5% p-mercaptoethanol (Laemmli, 1970). The 
proteins were fractionated by SDS-PAGE and transferred to an Immobilon-P membrane (Millipore, Bedford, MA). The 
blot was then blocked with 5% nonfat dry milk in PBS A a containing 0.01% Tween-20. This was followed by an 
incubation with the primary antibody for 3 h and after extensive washing, an alkaline phosphatase-conjugated 
secondary antibody was added for again a 3-hour incubation. Finally, the substrate NBT/BCIP (nitroblue tetrazolium 
55 plus 5-bromo-4-chloro-3-indolyl phosphate) for the secondary antibody was then applied to visualize the specific 
proteins on the membrane. The staining reaction was stopped by rinsing the blot with water. 

Antibodies 
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[0052] The following antibodies were used in Western blot and immunocytochemistry experiments. The monoclonal 
antibody directed against the E-tag was purchased from Pharmacia and was used in a 1/300 dilution for Western blot 
analysis and a 1/1000 dilution for immunocytochemistry. The secondary antibody used for Western blot analysis was 
the anti-mouse alkaline phosphatase-conjugated antibody (dilution 1/500; Sigma Chemical Company, St louis). We 
used an anti-mouse Cy5-conjugated antibody as a secondary antibody for immunostaining (dilution 1/200, Amersham 
Pharmacia Biotech, UK). 

[0053] The polyclonal antibody directed against aN-catenin was purchased from Santa Cruz Biotechnology and 
used for Western blot analysis (dilution 1/100) as well as. for immunocytochemistry (dilution 1/200, Santa Cruz 
Biotechnology, Inc., California). The secondary antibody applied in Western blot analysis was the anti-goat alkaline 
phophatase-conjugated antibody (dilution 1/7500, Sigma Chemical Company, St Louis). For immunocytochemistry, we 
used the anti-goat FITC-conjugated antibody (dilution 1/75, Chemicon International Inc., Temecula, CA). 
[0054] The polyclonal antibody directed against human and mouse a£-catenin was used for Western blot analysis 
(dilution 1/4000, Sigma Chemical Company, St Louis). The secondary antibody used was the anti-rabbit alkaline 
phosphatase-conjugated antibody (dilution 1/5000, Sigma). 

[0055] To detect E-cadherin in HEK293T cells by Western blot analysis, we applied the mouse HECD1 anti-E- 
cadherin monoclonal antibody from Zymed (dilution 1/250, Zymed Laboratories, Inc., San Francisco, CA) followed by 
the secondary anti-mouse alkaline phophatase-conjugated antibody (dilution 1/5000, Sigma). 

[0056] The anti-Pan-cadherin antibody (dilution 1/500, Sigma), recognizing at least E-cadherin, N-cadherin, P- 
cadherin, VE-cadherin, R-cadherin and T-cadherin, was also used in Western blot analysis. The secondary antibody 
used was the anti-rat alkaline phosphatase-conjugated antibody (dilution 1/5000, Sigma). 

[0057] For the detection of the GAL4 DNA binding domain fusion proteins, a rabbit anti-GAL4 DNA binding domain 
antiserum was applied (dilution 1/1000; UBI). For the detection of the GAL4 activation domain fusion proteins, a 
rabbit anti-GAL4 activation domain antiserum was used (dilution 1/1000, UBI). The secondary antibody used in Western 
blot analysis was anti-rabbit alkaline phosphatase-conjugated antibody (dilution 1/5000, Sigma). 

5' RACE 

[0058] In order to complete the cDNA of the clone isolated by the two-hybrid screen, we used 5' RACE technology 
(GIBCO BRL, Life Technologies, Gent, Belgium). The lacking 5* fragment was isolated using a gene specific primer 5- 
GCGGTTCTTCATCAGTTTGG-3' (GSP1 = FVR359R; Table 1) to synthesize the first strand of the cDNA. We 
performed a PCR with the primer set GSP2 5-CTCTTGGGTTTGCTGGTTGA-3' (= FVR360R; Table 1) and anchor 
primer 5-GAATTCG-TCGACTAGTACGGGIIGGGIIGGGIIG-3' (= FVR239F; Table 3), followed by a nested PCR with 
the gene specific primer GSP2 (= FVR360R; Table 1) and primer UAP 5'-GAATTCGTCGACTAGTAC-3' (= FVR240F; 
Table 3). This yielded 3 fragments from human mammary gland mRNA and 2 from human uterus mRNA. No fragments 
could be amplified from human fetal brain mRNA or human small intestine mRNA. The fragments were further 
characterized by cloning into pGEM(S>-T cloning system (Promega, Madison, Wl) and by subsequent DNA sequence 
analysis using the M1 3 forward (5'-CGCCAGGGTTTTCCCAGTCACGAC-3'; FVR283; Table 5) and M13 reverse primer 
(5-TCACACAGGAAACAGCTATGAC-3'; FVR284; Table 5). The specificity of the fragments was determined using the 
DNAstar software packages (DNASTAR Inc. Madison, USA), and by the Staden gap4 software (Bonfield et al., 1995). 
The RNA from normal tissue was purchased from Clontech (Clontech. Palo Alto, CA). 

Transfection procedure of HEK293T cells 

[0059] HEK293T cells (Graham et al., 1977; Wigler et al., 1 978) were transiently transfected with the pPNhANCTN 
and/or the pEFBOSANC_2H01E plasmid by a Ca 3 (P0 4 ) 2 transfection procedure. The cultured HEK293T cells were 
trypsinized and reseeded at 300.000 cells/ml 24 h before transfection. Two hours before transfection, fresh medium 
was added to the cells. For a 6-well plate, 2 ug of plasmid DNA per well was used, while for a culture flask of 25 
cm 2 a 5 ug of plasmid DNA was applied. The sterile, ethanol precipitated plasmid DNA (purified on a Qiagen DNA 
purification column, Qiagen Inc., CA, USA) was dissolved in 0.1 x TE (Tris-EDTA) buffer, pH 7.5. The appropriate 
amount of plasmid DNA was mixed with an equal volume of a mixture comprising 1 volume 125 mM CaCI^HEPES, pH 
7.05 and 4 volumes 0.1x TE. The DNA mixture was added very slowly to 1x HEPES/2x BS (buffered saline). Upon 
shaking mildly, the DNA-Ca 3 (P0 4 ) 2 precipitate was formed and could then be added to the medium covering the cells. 
The incubation of the cells was done at 37°C for 24 h. The DNA-transfection mixture was then removed and replaced by 
fresh medium consisting DMEM, supplemented with 10% FCS, 0.03% glutamine, 100 U/ml penicillin, 100 mg/l 
streptomycin and0.4mM sodium pyruvate. 

Immunocytochemistry 

[0060] HEK293T cells were reseeded and grown on glass coverslips and transfected with either the plasmid 
encoding aN-catenin, or with the plasmid encoding the E-tagged ANC_2H01 protein or with both plasmids. The cells 
55 were incubated according to the transfection protocol. When confluent monolayers were formed, cells were fixed with 
ice-cold methanol for 15 min at -20°C. The cells were washed with PBS A a. Subsequently, the cells were incubated for 
1 h at 37°C with either a polyclonal antibody against aN-catenin (goat anti-aN-catenin, Santa Cruz Biotechnology), or 
an antibody against the E-tag (Pharmacia) or both. After extensive washing with PBS A a, the appropriate FITC- or Cy5- 
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labeled secondary antibodies were applied for 1 h at 37°C. All antibodies applied were diluted in PBS A a containing 
0.04% gelatin. Finally, coverslips with cells were mounted with Vectashield (Vector Laboratories, Burlingame, CA). 
The cells were examined with a Zeiss LSM 410 confocal laser-scanning immunofluorescence microscope (Carl Zeiss, 
Jena, Germany). 

Raising of polyclonal and monoclonal antibodies against ANC.2 H0 1 protein 

[0061] We are raising a polyclonal antibody against the ANC_2H01 protein by immunization of rabbits with a 
ANC_2H01 -specific synthetic peptide, that was synthesized and purified by the VIB-Department of Medical Protein 
Chemistry. This peptide corresponds to the aa residues 73-87 (sequence NH 2 -DGKARNRNQNYLVP-COOH) and was 
chosen on the basis of the Protean program of the DNAstar software packages (DNASTAR Inc, Madison, USA). The 
peptide was extended with an additional cysteine residue at the carboxyterminus. Via this cysteine residue, the 
peptide was conjugated to keyhole limpet hemocyanin (KHL, Sigma). Three rabbits were each immunized with 200 ug 
peptide, mixed with complete Freund's adjuvans. The results of these immunizations are not available yet. 
[0062] Monoclonal antibodies will be generated using GST-fusion proteins. Several plasmids, containing different 
fragments of ANC_2H01 fused to the GST-cDNA are constructed. In a first approach, the aminoterminal non-zinc finger 
part was fused to GST and will be used to generate monoclonal antibodies. 

PAC Screening 

[0063] The isolation of a human genomic DNA clone comprising the ANC_2H01 gene was done by PCR screening 
of a PAC clone library RPCI1 (loannou and de Jong, 1996). Pools of human PAC library clones were obtained from the 
UK HGMP Resource Center (Hinxton, Cambridge, UK) and were screened by PCR, using primers FVR513F (Table 1) 
and FVR514R (Table 1). A 338-bp PCR fragment was amplified on ANCJ2H01 cDNA. Out of these pools of human 
PAC library clones, one positive clone was identified and an aliquot of this clone was streaked out on a LB-agar 
plate containing kanamycin and single colonies were re-examined by PCR. A positive colony was then grown and used 
for DNA isolation. 

Fluorescence In Situ Hybridization analysis (FISH) 

[0064] FISH analysis using PAC clone 167024 specific for the ANC_2H01 gene was performed according to 
standard procedures (Kievits et al., 1990) with some minor modifications. The DNA of the PAC clone was biotinylated 
using a BioNick-kit (Gibco BRL) according to the manufacturer's protocol. Fluorescence image results were captured 
30 by a Photometries Image Point CCD camera (Photometrics-GmbH, Munchen, Germany) mounted on a Zeiss Axiophot 
microscope (Carl Zeiss, Jena, Germany). Image processing was performed and chromosome G-banding was obtained 
by reverse DAPI-banding using the MacProbe V3.4.1 software (Perceptive Scientific International LTD., League City, 
TX, USA). 

CASTing 

35 

[0065] CASTing (Cyclic Amplification and Selection of Target sequences (Wright et al., 1991) will be used to 
determine any DNA sequences to which the ANC2H01 protein binds specifically. The first step is the construction of a 
plasmid to obtain the protein of interest fused to GST. Using this GST-part, the protein can be immobilized on a 
gluthatione column. We have cloned the zinc finger domain of ANC_2H01 into the pGEX4T vector (Pharmacia). 

40 RESULTS 

Two-hybrid cDNA library screening 

[0066] For the initial two-hybrid screen, almost the full-length aN-catenin cDNA was fused to the GAL4 DNA 
45 binding domain in the pAS2 vector (Clontech) and assayed for interaction with proteins encoded by a GAL4 activation 
domain cDNA library from human kidney (Clontech). About 3x1 0 5 a clones were screened and examined for interaction 
with aN-catenin on the basis of induction of two genes: the selection gene HIS3 and the reporter gene LacZ. One 
clone, that exhibited the desired HIS3-positive, p-galactosidase-positive phenotype, was isolated out of this screen 
and was further examined. The clone contained an insert of about 2,500-bp (Fig. 2). The cDNA insert of the clone was 
completely sequenced using universal and specific walking primers (see Table 1). The cDNA insert revealed an open 
50 reading frame (ORF) of 459 amino acids. The insert contained a stop codon, a poly-A signal, a poly-A tail, but no 
start codon could be detected. Analysis of the encoded protein revealed the presence of 9 Cys 2 His 2 zinc fingers at 
the carboxy-terminal half of the protein (Figs. 1 to 3). Zinc fingers of the Cys 2 His 2 -type, originally discovered in 
TFIIIA (Hanas et al., 1983; Miller et al., 1985), often function in nucleic acid binding and, more particularly, in 
sequence specific recognition of DNA, which is pivotal for the function of transcription factors. The 3' 
untranslated region (UTR) contained an Alu repeat region of 286 nucleotides (Figs. 1 and 2). The two-hybrid clone 
55 isolated in this screen was called ANC_2H01 (for Alpha-N -catenin 2-hybrid clone 01 ) . 

Retransformation of ANC_2H01 in the two-hybrid system 
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[0067] The primary purpose of this test is to check the specificity of the interaction between ANC_2H01 and aN- 
catenin. The isolated two-hybrid clone was retransformed into the HF7c strain. This ANC_2H01 clone was combined 
with either one of the following: the original bait plasmid pAS2ANCTN, the empty vector pAS2, the pGBT9ANCTN 
plasmid, the empty vector pGBT9 and pl_AM5', a plasmid encoding an irrelevant protein (iamin C) fused to the GAL4 
DNA binding domain. The ANC_2H01 exhibited the desired HIS3-positive p-galactosidase-positive phenotype upon 
combination with either pAS2ANCTN or pGBT9ANCTN. No interaction was seen when combination with either one of 
the empty vectors pGBT9 or pAS2, nor when combined with pl_AM5\ Therefore, the initial observation was confirmed, 
which means that the interaction between ANC_2H01 and ocN-catenin might be specific. 

Delineation of the domains mediating the interaction between ANC_2H01 and aN-catenin proteins 

[0068] By the use of the two-hybrid system, we tried to analyze which part of the ANC_2H01 protein is 
responsible for the interaction with aN-catenin. Three different fragments of the initial two-hybrid clone ANC_2H01 
were subcloned in the pGAD10 vector and were designated, respectively, ANC_2H01/BamHI, ANC_2H01/600 and 
ANC_2H01/500 (Fig. 4). These three cDNA fragments each include the amino-terminal part of ANC_2H01. No 
interactions, however, could be observed between these different parts of ANC_2H01 and the aN-catenin protein 
(Fig. 4). 

[0069] By cloning different parts of aN-catenin into the pGBT9 vector, we delineated the domain responsible for 
association with ANC_2H01 to amino acid residues 4-537 of aN-catenin. The transformants harboring the plasmid 
pGBT9ANCTN(EcoRI-Pstl) together with the ANC_2H01 plasmid exhibited the HIS3-positive, P-galactosidase-positive 
phenotype in the two-hybrid system (Fig. 4). All other combinations of aN-catenin fragments and ANC_2H01 did not 
result in the expression of the two reporter genes. Besides the interaction with the initial ANC_2H01 clone, we also 
looked in the two-hybrid system for the interaction of the various aN-catenin fragments with ANC_2H01/BamHI, 
ANC_2H01/600 and ANC_2H01/500. However, we could not observe any interaction between the different fragments 
of aN-catenin and any of the truncated ANC_2H01 fusion proteins (Fig. 4). 

[0070] As aE-catenin is an isoform of aN-catenin, the cDNA for aE-catenin was cloned in frame with the GAL4 
DNA binding domain into the pGBT9 two-hybrid vector. This clone was then assayed for interaction with ANC_2H01 
and truncated derivatives in the two-hybrid system. However, no interaction was observed when both plasmids were 
cotransformed into the HF7c strain (Fig. 4). 

Northern Blot Analysis 

[0071] in order to estimate the length of the complete mRNA of ANC_2H01 , we performed a Northern blot analysis. 
Total RNA of various human cell lines was hybridized with a 32 aP-labeled 700-bp BamHI fragment of the ANC_2H01 
cDNA. A very weak signal could be detected in some cell lines, but most of the lanes lacked specific signal. However, 
a stronger signal was seen in the lanes with GLC34 and GLC8 RNA (Fig. 5). The positions of 28S and 18S ribosomal 
RNA on the blot were visualized using methylene blue staining. Using these positions as markers, we could estimate 
the size of the ANC_2H01 mRNA to be about 3,000 bp. The cDNA insert of the original ANC_2H01 two-hybrid clone 
was about 2,500 bp, and contained a stop codon, a poly-A signal and a poly-A tail (Fig. 2). However, a start codon 
could not be detected. This implies that we were lacking a 5' fragment of about 500 bp. 

Multiple tissue RNA dot blot analysis 

[0072] To survey tissue-specific expression of ANC_2H01, a multiple tissue RNA dot blot analysis was performed 
with a commercially available human RNA dot blot (Clontech). The ANC_2H01 mRNA showed a ubiquitus expression 
(Fig. 6). However, the strongest expression was observed for pituitary gland and adrenal gland mRNA. Lung, placenta, 
fetal liver and fetal lung also showed increased expression of the mRNA. 

5' RACE 

[0073] To isolate the lacking 500-bp 5' fragment of the ANC_2H01 cDNA, we used the 5' RACE technology (Gibco 
BRL). For this purpose, we used human mRNAs derived from mammary gland, fetal brain, uterus and small intestine 
(Clontech). Using a gene specific primer , an ANC_2H01 specific cDNA strand was synthesized. Using two nested 
primers sets (set #1 and set #2; see Table 1), we isolated 5 fragments from mammary gland and uterus mRNA. No 
fragments were amplified using the fetal brain or the small intestine mRNA. The 5' RACE fragments were cloned into 
the pGEM®-T cloning system and characterized by DNA sequence analysis. One of the inserts derived from mammary 
50 gland mRNA turned out to be specific for ANC_2H01. The four other 5'RACE products were nonspecific. From further 
sequence analysis using walking primers (see Table 1) the size of the 5" RACE fragment was determined to be 823 bp 
(Fig. 2). This 823-bp fragment contained 520 bp as new 5' sequence and 303 bp as contained within the initial 
ANC_2H01 two-hybrid clone (Fig. 2). The 5' RACE fragment contained a start codon and 26 more codons that were not 
covered by the original ANC_2H01 two-hybrid clone (Fig. 1). In addition to the Alu repeat region detected in the 
3'UTR of the cDNA, the 5'UTR also contains an Alu repeat region (Fig.1 and Fig. 2). 

55 

The ANC_2H01 gene product is a zinc finger protein 

[0074] Scrutiny of the entire sequence of the ANC_2H01 cDNA revealed several interesting features. The cDNA 
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20 



30 
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has a total length of 3,013 bp. The 5'UTR was as long as 445 nt The context of the start codon suits an adequate 
Kozak initiator sequence (Kozak, 1996). The 3' region contains a poly-A signal and a poly-A tail, indicating that 
also the 3'UTR is completely isolated and has a size of 1,112 nt The 3'UTR as well as the 5'UTR contain one Alu 
repeat region (see Fig. 1). The cDNA encodes a protein of 486 amino acid residues (aa). The deduced molecular mass 
of the protein is 53.46 kDa. The protein contains nine zinc fingers in its carboxy-terrninal half. All of these zinc 

5 fingers are of the Cys 2 His 2 type (Fig. 3). They are clustered into two domains and followed by 3 aa only before the 
translational stop codon. The first five Cys2His 2 zinc fingers are separated from the last four zinc fingers by 33 aa. 
Within each cluster, each zinc finger motif is only 3 to 5 aa apart from the following zinc finger. The presence of 
the 9 Cys 2 His 2 type zinc fingers indicates that the protein binds double stranded DNA. In addition, a nuclear 
localisation signal (PKKRKRK; for review Gorlich and Mattaj, 1996) is present in the protein at the amino-terminal 

10 side (residues 5-1 1 ; Fig. 1 ). 

[0075] Further characterization of the isolated clone included a BLASTN search analysis (Altschul et al., 1990). 
The complete cDNA of ANC_2H01 did not correspond with any cloned full-size cDNA in the public-domain databases. 
However, several EST (Expressed Sequence Tag) clones with high degree of sequence similarity (low P-score) are 
present in the databases. These ESTs are from human origin (Table 7), as well as mouse origin (Table 8). 

15 The ANC_2H01 protein localizes to the nucleus and translocates aN-catenin into the nucleus 

[0076] HEK293T cells were transiently transfected with the pPNhANCTN plasmid and/or the pEFBOSANC_2H01 E 
plasmid by a Ca 3 (P0 4 ) 2 transfection procedure. The subcellular localization of the aN-catenin and the ANC_2H01 
protein were analyzed by immunocytochemistry. The epitope tagged zinc finger protein could be specifically stained 
with a monoclonal antibody against the E-tag. To detect aN-catenin a commercial polyclonal antibody was used. 

20 The zinc finger protein ANC_2H01 localized to the nucleus upon transient expression in HEK293T cells (Fig. 7A). This 
nuclear staining was seen as well in cells transfected with the dEFBOSANC_2H01E plasmid alone as in cells that were 
transfected with both expression plasmids. In contrast, the aN-catenin protein was localized into the cytoplasm of 
cells transfected with plasmid pPNhANCTN only (Fig. 7B). When combined with the expression plasmid encoding the E- 
tagged ANC_2H01 protein, aN-catenin localized to the nucleus (Fig. 7C-E). So, a translocation of aN-catenin into the 

25 nucleus occurs, when the interacting zinc finger protein ANC_2H01 was co-expressed. 

Western Blot Analysis 

[0077] Lysates of HEK293T cells transiently transfected with pPNhANCTN and/or pEFBOSANC_2H01 were 
prepared 24 h after transfection and examined for the expression of the recombinant proteins. The ANC_2H01 protein 

30 was detected by the use of the E-tag. The tagged protein has a predicted molecular mass of approximately 55 kDa and 
such a band was indeed detectable (Fig. 8A). Two weaker bands with lower molecular weight were also observed on 
the blot. This may be due to proteolysis of the protein at the carboxy-terminus. The E-tag, which is fused at the 
amino-terminal part of the protein, is still available in protein fragments degraded at the carboxy-terminus. 
[0078] The presence of aN-catenin was revealed by the use of a polyclonal antibody against aN-catenin. A single 
band of approximately 100 kDa was recognized by this antibody in single and double tranfected cells (Fig. 8B). The 

35 predicted molecular mass of aN-catenin is 100,686 daltons. 

[0079] The presence of other cadherins and cadherin-associated molecules was also examined in these cells. E- 
cadherin was present in non-transfected cells as well as in single and double transfected cells (Fig. 8B). Using an anti- 
pan-cadherin antibody, a broad band could be observed for the different transfected cells, probably covering several 
cadherins. We also tried to detect aE-catenin, using an antibody against this protein. The transfected as well as the 

4Q non-transfected cells showed staining for aE-catenin (Fig. 8B). When recombinant aN-catenin was expressed, the 

^ anti-aE-catenin antibody recognized also a larger band. This additional band was observed in the aN-catenin single 
transfectant, but was weaker in the double transfectant. We concluded that the anti-aE-catenin antibody used 
recognized also the recombinant aN-catenin expressed in these cells. 

[0080] At a semi-quantitative level we could not detect any drastic differences in the untransfected versus the 
transfected cells upon staining for E-cadherin, pan-cadherin, or aE-catenin. 

45 

PAC screening 

[0081] Superpools and subsequent plate-pools from the UK-HGMP PAC library (Roswell Park Cancer Institute, 
Buffalo, NY) were screened by PCR with ANC_2H01 -specific primers FVR513F and FVR514R (Table 1). This resulted 
in the determination of the plate number of a 384-well microtiter plate containing a positive clone. Further PCR 
50 analysis of the pooled rows and columns pinpointed the positive clone 167024 as specific for the ANC_2H01 gene. 

Chromosomal localization of the ANC_2H01 gene by FISH analysis 

[0082] FISH was performed using the PAC clone 167024 specific for the ANC_2H01 gene. We mapped the gene 
encoding the zinc finger ANC_2H01 on the chromosomal locus 3q27-28. To this end, we analyzed 20 metaphases. Up 
55 to 15 of them revealed discrete hybridization signals on both chromosomes 3 at band 3q27-28 (Fig. 9). The 
other metaphases showed signal on only one of both chromosomes 3. 

Two-hybrid analysis of the interaction of ANC_2H01 with a E/aN-catenin chimeras 
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[0083] The zinc finger protein ANC_2H01 does not interact with aE-catenin when tested in the yeast two-hybrid 
system. The human aE-catenin protein shows, however, 80 % homology with the closely related human aN-catenin 
protein. In spite of this high similarity, their tissue distribution is clearly distinct. Both proteins show also 
identity with vinculin in three regions with higher sequence conservation, the so-called vinculin homology domains 
5 (VH) (Herrenknecht et al., 1991) (Fig. 10). In an approach to further characterize the specific interaction between aN- 
catenin and ANC_2H01, we madea-catenin chimeric molecules by exchanging these VH domains between aN-catenin 
and aE-catenin. We constructed six different a N/aE-chimeras fused to the GAL4 DNA binding domain (Fig. 10). The 
possible interaction of these six aN/aE-chimeras with the ANC_2H01 protein was analyzed using the two-hybrid 
system. We observed no expression of either the selection gene HIS3 or the reporter gene LacZ when either the 
initial ANC_2H01 two-hybrid clone or pGAD424ANC_2H01-FL was cotransformed into Y190 yeast cells with 
pGBT9a ECTN, pGBT9aECTNVH1 N, pGBT9aNCTNVH2E or pGBT9aECTNVH3N (Fig. 11). In addition, no 
interaction could be detected when the Y190 yeast strain was cotransformed with the initial ANC_2H01 two-hybrid 
clone or pGAD424ANC_2H01-FL on the one hand, and the pGBT9aNCTNVH1 E and pGBT9a ECTNVH2N plasmids on 
the other hand. However, a HIS3-positive, p-galactosidase-positive phenotype was obtained upon combination of 
ANC_2H01 or pGAD424ANC_2H01-FL with either pGBT9ANCTN or pGBT9uNCTNVH3E (Fig. 11). 

DISCUSSION 

[0084] For a better understanding of the respective role of the two subtypes of a-catenin protein, we set up a two 
hybrid screen with human aN-catenin as a bait The aim was the isolation of proteins that interact specifically with aN- 
catenin. From a human kidney cDNA library, a single clone was isolated exhibiting the desired HISS- 
SO positive, p-galactosidase-positive phenotype. This two-hybrid clone was completely sequenced and the partial cDNA 
was completed by 5' RACE. This resulted in a full size cDNA of 3,013 nucleotides (nt) in correspondence with the 
results of the Northern blot analysis. The 3'UTR as well as the 5'UTR contain an Alu repeat region. In addition, the 
3' region contains a poly-A signal and a poly-A tail, indicating that the 3'UTR is completely isolated. 
[0085] The ATG start codon occurs in an adequate, although not fully optimal context according to Kozak (Kozak, 
1996). The open reading frame is 1458 nt in length and encodes a protein of 486 aa. Structurally, the protein can be 
divided into two domains. The amino-terminal part contains no obvious protein motifs, except for a stretch of basic 
aa nearby the amino-terminus of the protein (PKKRKRK, aa 5-11). This short sequence resembles the nuclear 
localization signal of the SV40 large T antigen (PKKKRKV) (Kalderon et al., 1984) and might display the same role in 
the ANC_2H01, i.e. targeting of the protein to the nucleus. On the other hand, the carboxy-terminal part consists of 
9 zinc fingers of the Cys 2 His 2 type. These are clustered in two domains, separated from each other by 33 aa. Between 

30 consecutive zinc fingers, generally five aa are present. Zinc finger motifs of the Cys 2 His 2 type were first discovered in 
the transcription factor TFIIIA and have DNA binding properties (Hanas et al., 1983; Miller et al., 1985). Another 
feature of these motifs is their autonomous folding up and their stabilization by chelation of zinc between a pair 
of cysteine and a pair of histidine residues. It has also been shown that in these polydactyl proteins the DNA 
interaction is dominated by only few of the many zinc fingers (Sun et al, 1996; Georgopoulos et al., 1997). One zinc 
finger of the Cys 2 His 2 type binds typically 3 bp of a double-stranded DNA sequence, and this is called a subsite. 

35 Structural studies of such DNA-protein complex also revealed that consecutive fingers of a polydactyl protein 
interact with subsites directly adjacent to each other (Pavletich and Pabo, 1991 and 1993). It was also suggested 
that not all of the zinc fingers from a polydactyl protein contribute to DNA binding and recognition and that the 
remaining non-DNA-binding zinc fingers may participate in protein-protein interactions. Interactions with another 
protein as well as homodimerization have been reported in this context (Sun et al., 1996; Morgan et at., 1997). 
[0086] In this study, we could also show that several amino-terminal fragments of ANC_2H01 do not interact 

40 with aN-catenin, at least not in the two-hybrid system. These results may suggest that the amino-terminal domain is 
not responsible for the interaction. It can also be explained by inappropriate folding of the truncated proteins. In 
the first case, we should seek an interacting domain in the two zinc finger clusters or in the space of 33 aa in 
between. Using the two-hybrid system, the associating domain of aN-catenin for ANC_2H01 could be delineated at aa 
193-535. In addition, the interaction of aE-catenin with the ANC_2H01 protein could not be observed in the two-hybrid 
system. Further studies using aE/aN-catenin chimeras showed that for the interaction with the ANC_2H01 protein, the 

w first two vinculin homology (VH) domains of aN-catenin are needed. No interaction was observed when only the VH1 or 
the VH2 domain of aN-catenin was included in chimeric proteins with aE-catenin. In agreement with the aforementioned 
data, these results strengthen the conclusion that the interaction with the ANC_2H01 is aN-catenin-specific. 
[0087] To localize the novel ANC_2H01 zinc finger protein and the aN-catenin/ANC_2H01 complex within cells, we 
transiently transfected expression constructs encoding these two proteins into human cells (HEK293T) and visualized 

50 the proteins by immunofluorescense. The aN-catenin was distributed throughout the cytoplasm, as reported previously 
(Shibuya etal., 1996). This suggests that the aN-catenin molecules are generally dissociated from cadherins, because 
cadherins were detected in the lysates of the HEK293T cells. Co-expression of the cDNA encoding the tagged zinc 
finger protein, resulted in the translocation of aN-catenin to the nucleus. The ANC_2H01 protein was consistently 
detected in the nucleus. This observation is in line with the presence of a putative nuclear localization signal in 
the ANC_2H01 protein and with the putative DNA binding nature of the zinc fingers. 

55 [0088] Using FISH the ANC_2H01 gene could be mapped to the q27-28 region of human chromosome 3. A study 
of James and coworkers (James et al., 1996) using YACs covering the human chromosomal regio 3q27, identified ESTs 
for five genes, including three members of the cystatin gene family and a gene thought to be involved in B-cell non- 
Hodgkin lymphoma. The latter was confirmed by the identification of the BCL6 (E-cell non-Hodgkin lymphoma) disease 
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gene in this region (Chaganti et al., 1998). The p63 gene, which bears strong homology with the tumor suppressor 
gene p53 and p73 is also localized in the chromosomal region 3q27-29 (Yang et al., 1998). In addition, another 
member of this family, p73L, was also assigned to the 3q27-28 region (Senoo et al., 1998). 

[0089J Taken together, these data show that aN-catenin regulates gene expression by a direct interaction with a 
5 novel nuclear zinc finger protein. This transcription factor, that was cloned by us, shows no identity with any cDNA 
or protein in the public databases, except for a number of unspecified EST clones. The ANC_2H01 protein induces a 
translocation of aN-catenin to the nucleus by protein-protein association, an observation that has not been reported 
before. The interaction of the zinc finger protein is considered to be aN-catenin-specific. It is now possible to 
elucidate the target genes of the transcription factor and the function of the aN-catenin/ANC_2H01 complex in cell-cell 
adhesion, cellular differentiation and other signalling pathways. Furthermore, it is possible to further delineate 
10 the interacting domains of ANC_2H01 and aN-catenin using the two-hybrid system and other approaches; to determine 
the DNA sequences to which the ANC_2H01 protein specifically binds; to identify other proteins, specifically 
interacting with the ANC_2H01 protein; to raise specific monoclonal and polyclonal antibodies for the ANC_2H01; and 
for example to perform drug screens that either enhance or reduce the interaction between the ANC_2H01 and aN- 
catenin or other proteins. 

75 BRIEF DESCRIPTION OF THE DRAWINGS 

[0090] Figure 1 : Sequence of the full-size human ANC_2H01 cDNA. Dots separate blocks of 10 nucleotides. 
Sequences of primers used in the 5' RACE experiment are indicated by named arrows. The predicted amino acid 
sequence of the ORF is indicated in bold (one-letter code). The start codon is boxed in and is preceded by an in- 
frame stop codon (shaded box). The Alu repeat regions in the 5'UTR and the 3'UTR are shaded and boxed in. Also 
20 indicated are the nine zinc finger motifs. The amino acid sequence of the peptide used to raise polyclonal 
antibodies is underlined. The sequence of the putative NLS is also boxed in. 
[0091] Figure 2 : Cloning of the full-size cDNA encoding the ANC_2H01 protein. 

ORF = Open Reading Frame; UTR = Untranslated region; ATG = start codon; TGA = stop codon; AAA = poly A tail; 
aa = amino acid residues 

25 [0092] Figure 3: Alignment of the nine zinc finger motifs (ZF) of the ANC_2H01 protein. Numbers indicate the 
number of the codon. The amino acids are in the one-letter code. 

[0093] Figure 4: Analysis of interaction of different parts of a N-catenin and full-length aE-catenin with 
ANC_2H01and different parts thereof. VH: Vinculine Homology domain;IMD: Not Done. 

[0094] Figure 5: Northern blot analysis of ANC_2H01 mRNA in three human tumor cell lines. 28S and 18S are 
ribosomal size markers. 

30 [0095] Figure 6 : The expression of ANC_2H01 mRNA in various human tissues was studied by use of a human 
RNA master blot (Clontech). Hybridization was as described in Materials and Methods. 

[0096] Figure 7: I mmunocyto chemistry of HEK293T cells transfected with either pPNhANCTN (A) or 
pEFBOSANC_2H01E (B) or with both plasmids (C-E). Tagged ANC_2H01(A) The nuclear staining for the tagged 
ANC_2H01 protein was visualized using an antibody against the E-tag. (B) The cells were stained with an polyclonal 
35 antibody against a N-catenin. The aN-catenin protein is localized in the cytoplasm of the cells. (C) Upon 
coexpression with aN-catenin, the ANC_2H01 protein remains localized in the nucleus. (D) Upon cotransfection, aN- 
catenin is translocated into the nucleus. (E) Double immunostaining for the ANC_2H01 and aN-catenin protein was 
done with the antibodies applied in (A) and (B). Bar: 12 urn. 

[0097] Figure 8 : Western blot analysis of transfected and untransfected HEK293T ceils. The cells were 
transfected with either pPNhANCTN (indicated by CTN) or pEFBOSANC_2H01E (indicated by ZF) or with both 

40 plasmids. Molecular weight markers are indicated each time at the left. (A) Blots were stained with the monoclonal 
antibody against the E-tag. A band of approximately 60 kDa is detected (closed arrowhead), which is the expected 
size for the tagged ANC_2H01 protein. Two weaker bands are also detected and are thought to be degradation 
products (closed arrows). As a positive control, an unrelated E-tagged protein was loaded in the lane indicated with 
+. (B) The expression levels of several proteins of the cadherin-catenin complex (indicated by closed arrowheads) 
were also analyzed in these cells (E-cadherin, N-cadherin, pan-cadherin, aN- and aE-catenin), but no differences 

45 could be observed at a semi-quantitative level. The band indicated with an open arrow is presumably aN-catenin, what 
indicates that the anti-aE-catenin antibody used is cross-reacting with aN-catenin. 

[0098] Figure 9 : Chromosomal localization of the human ANC_2H01 gene at band 3q27-28 by fluorescence in situ 
hybridization. (A) Ideogram of chromosome 3; (B) Fluorescence signal specific for the ANC_2H01 gene; (C) DAPI 
staining of the same chromosomes. 
50 [0099] Figure 10 : (A) Schematic representation of the vinculin homology domains (VH) in the a1E-catenin and 
the a2N-catenin protein. a2 isoforms differ from theal isoforms by an aternatively used exon nearby the 
carboxyterminus (black box). (B) Schematic representation of the coding potential of the aE/aN-chimeric cDNAs 
inserted into the pGBT9 two-hybrid vector. GAL4-DBD = GAL4 DNA-binding domain 

[0100] Figure 11 : The Saccharomyces cerevisiae strain Y190 was cotransformed with: (A) plasmid 

pGAD424ANC2H01-FL encoding full-length ANC_2H01 fused to the GAL4-AD, in combination with a plasmid 
55 encoding aN-catenin, aE-catenin or aE/aN-catenin chimeras fused to the GAL4-DBD (Fig. 10), and (B) the initial two- 
hybrid clone pGAD10ANC2H01 encoding part of ANC_2H01 fused to the GAL4-AD, in combination with plasmids 
encoding aN-catenin, aE-catenin or aE/aN-catenin chimeras fused to the GAL4-DBD (Fig. 10). The plates contained the 
medium composition indicated at the left completed with 40 mM 3-AT and 80 mg/ml X-gal. 
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GAL4-AD = GAL4 transcription activation domain; GAL4-DBD = GAL4 DNA-binding domain; -LT = SD medium lacking 
Leu and Trp; -LTH = SD medium lacking Leu, Trp and His. pVA3 and pTD1 are plasmids used as a positive control 
in the two-hybrid system (Matchmaker™, Clontech). 

TABLE 1: PRIMERS USED FOR CHARACTERIZATION OF THE HUMAN 
ANC 2 HOI CDNA 



Primer a 



10 



15 



20 



25 



30 



35 



40 



45 



50 



Posit i 
on b 



Applicat 
ion c 



Sequence 5 ■ 



T3- 



FVR2 74F 


845 


SE 




FVR2 91F 


1 ,215 


SE 




FVR2 93F 


1, 657 


' SE 




FVR3 08R 


2, 766 


SE 




FVR3 09R 


812 


SE 




FVR310R 


666 


SE 




FVR345F 


■ 673 


CL 




FVR34 6R 


996 


CL 




FVR347R 


1, 095 


CL 




FVR35 9R 


940 


RA 




FVR3 6 0R 


804 


RA 




FVR421R 


478 


SE 




FVR4 62R 


2, 364 


SE 




FVR463F 


1, 660 


SE 




FVR464R 


135 


SE 




FVR4 65R 


2,459 


SE 




FVR508F 


43 


SE 




T7"\7"DC: fiQD 
r VKbuyK 


Ton 

i y y 


SE 




FVR510F 


327 


^P 




FVR511F 


1 , 343 


SE 




FVR512R 


1,466 


SE 




FVR513F 


1,499 


SE, 


PAC 


FVR514R 


1, 817 


SE, 


PAC 


FVR515F 


1, 888 


SE 




FVR516F 


2, 098 


SE 




FVR51 7F 


2 , 298 


SE 




PVR^I ftp 


9 ^ Q H 






FVR519F 


2 , 864 


SE 




FVR660F 


537 


CL 




FVR661R 


1, 880 


CL 




FVR662F 


445 


CL 




FVR663R 


728 


CL 




FVR104 3F 


438 


CL 




FVR1044F 


1, 052 


CL 




FVR104 5R 


1, 050 


CL 




FVR1046R 


1, 900 


CL 




FVR1237F 


447 


CL 




FVR123 8R 


729 


CL 




FVR1240F 


526 


CL 




FVR1242R 


1, 933 


CL 




FVR1411F 


447 


CL 




FVR1412R 


1, 033 


CL 




FVR1413F 


1, 053 


CL 




FVR1414R 


1, 900 


CL 





actgctgaagatgttccaat 

aggat ccc tacatttgtaaa 

ttgtgatgactgtgggaaag 

cctgggcaacatagcgagat 

ggactctcctcttgggtttg 

tctgtttctggccttgattc 

gcactatggccagaaacagaaaLcaga 

q qaattc ctqqqcaqtcacattcaaaq 

q gaattc catatgctqcttitiaaqtcaq 

gcggttcttcatcagtttgg 

ctcttgggtttgctggttga 

aacgagaagggtgtagagtc 

tcctgtaatagtggcttcct 

tgatgactgtgggaaaggct 

ggaggcggaggttgtggtga 

cctccctcatcccatcttaa 

ccaaagcctccagcctgaga 

aggccgaggcgaacagatca 

gctgggattacaggcgtgaa 

tcaagcagtgaactctacct 

aactctattaatttacatgc 

aacaatggtgaacatggaca 

tcactacatttatgaggcaa 

ccatgagacaacttgattat 

gcgtgatagtttgtagttta 

cgaccagaactaaaatgcaa 

agatggtctcgttctgttgt 

tgcctgactcttgcccaaat 

atcgtcagcgacataggtcaatggaattttctctgat 

ataaqaat gcqqccqc tqttqtctcatqqactqqaaq 

ataaqaat gcgqccgc tacqaatqaqtatcctaaaa 

cggatacagcatagcgtagaaaaggcagtgtggtc 

cqt gqatcc qaaaaqatatqaatgagtat 

cct ctcgag caaaqttcacatttataqaq 

q qaattc qcctctataaatgtqaactt 

ccq ctcgaq aaqttaaaqaqaataatcaa 

q qaattc ctqaatgaqtatcctaaaaaa 

atgcatgctgtagaaaaggcagtgtggt 

cgtcgcggccctgcagatggattcaatgga 

tccc cccqqq qqqatgaatttattatttta 

tcc cccggg tatgaatgagtatcctaaaaaa 

aaaa qtcqacq gccactqctattaqctctc 

ggaattcttctataaatgngaacttcgt 

aaaa qtcqac aaqttaaagaqaataatcaa 



55 
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Legend to Table 1 ; 



a R (Reverse) and F (Forward) refers to the sense or 
antisense orientation of the primers. 
b The position of the most 5' nucleotide is given. 
Sequences are numbered according to the cDNA starting 
from the putative transcription initiation site (Fig. l) . 
c Application of the primers: RA, 5' RACE ; PAC, PCR 
isolation of a PAC clone; SE, sequencing; CL, cloning of 
cDNA fragments. 

d Restriction sites added are underlined and in bold. 
5' RACE primer set #1; FVR359R + FVR239F (Table 3) 
5' RACE primer set #2: FVR360R + FVR240R (Table 3) 

TABLE 2 : PRIMERS USED FOR SEQUENCING THE INSERTS OF TWO- 
HYBRID VECTORS 



Primer 


Vector 


Sequence 5 ' -» 3 * 


FVR174F 


pGAD4 24 / 


accactacaatggatgatgt 




pGADIO 




FVR175F 


pGBT9 / pAS2 


atcatcggaagagagtagta 


FVR192R 


pGAD4 24 / 


taaaagaaggcaaaacgatg 




pGADIO 




FVR217R 


pGBT9 / pAS2 


aaaatcataaatcataagaa 


TABLE 3 : 


PRIMERS USED FOR 


5 ' RACE 


Primer 


Applicat 


Sequence 5 ' -» 3 1 




ion 





FVR239F 5' RACE gaattcgtcgactagtacgggiigggiigggiig 
FVR240F 5' RACE gaattcgtcgactagtac 
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TABLE 4: PRIMERS USED FOR IN-FRAME CLONING OF aN-CATENIN 
INTO PGBT9 AND PAS 2 VECTORS 



Primer 


Positi 


Applicat 


Sequence 5 ' 3 Ma ' 




on 


ion 




FVR137F 


39 


PCR, CL 


acccccccrcraaacaacttcacctatcattc 


FVR13 8R 


1249 


PCR, CL 


gccgccgccttccttttcatttccgctctt 



a : Restriction sites added are underlined and in bold. 



TABLE 5: M13F/R PRIMERS USED FOR SEQUENCING THE INSERTS 
OF PGEM C -T CLONES 



Primer 


Applicat 


Sequence 5 1 — > 3 1 




ion 




FVR2 83 


SE 


cgccagggttttcccagtcacgac 


FVR2 84 


SE 


tcacacaggaaacagctatgac 



-17- 



EP 1 054 059 A1 





TABLE 6 : 


PRIMERS TT^Fn 


T^TiP QPTHlPMrTMP C\X) r*T h\t T MP nc r/ W /nM 
ruii 0CjVUCj1>JL. J. INLi UK \wi-iVJJ\ 1 rJvj Ur OCtj/CXiM- 


5 


V— fill 1 Cj IvriO 


JL1N roDl^ IVVU - 


riibKlD VLLIUK 




Primer* 


Application 0 


Sequence 5 1 — > 3 ' c 




FVR51F 


SE 


cgttccgatcctctatactgc 


10 


FVR54R 


SE 


atttgagtgacgaacagtgt 




FVR157R 


SE 


ctggtct tcttggtcat ttta 




r VK IbUK 




t t cagatggtggcagt agag 


15 


FVR73 8F 


SE 






FVR1157R 


SE 


tcagaagcaggacgagcgt 




FVR1241F 


PCR/CL 


cqaaattccCQqqqqcaacttc 


20 


FVR1243R 


PCR/CL 


tcattaaqaqcatatqccaqct 




FVR1244F 


PCR/CL 


aattcccaaqcqcccaqctaqc 




FVR1245R 


PCR/CL 


t cc tccaqqqaccraccaaaaqc 


OR 

do 


FVR1246F 


PCR/CL 


aqqttcCQqccqtccctqca 




FVR1247R 


PCR/CL 


qqaatatcacrtacctqctcaqc 




FVR1248F 


PCR/CL 


qctaaqcaaataccqatattcc 


30 


FVR124 9R 


PCR/CL 


ttqqctqcaqatcaacqqtatc 




FVR1250F 


PCR/CL 


actqqcatatqcactcaataac 




FVR1251R 


PCR/CL 


cctqqaaqctqaatacctqttc 


35 


FVR1252F 


PCR/CL 


aacacrcrtacccaqcttccaqq 




FVR1253R 


PCR/CL 


Cttqqctacaqqtcqactct 




FVR1311F 


SE 


ctgtgtccccaggtcatcaa 



40 



Legend to Table 6 ; 

45 A R (Reverse) and F (Forward) refers to the sense or 
antisense orientation of the primers. 

b Application of the primers: SE, sequencing; PCR/CL, 
so cloning of cDNA fragments obtained by PCR. 

c Restriction sites added are underlined and in bold. 
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Table 8: Mouse EST clones with high sequence homology to 
ANC 2H01 



EST -IE — 


CloneiD 


HCBI=TD 


Genfcank- 
ID 


P- score 


TISSUE - 


mh04b01 . rl 


441481 
(5«) 


627435 


AA013518 




placenta 


mh04f01 . rl 


441529 
(5') 


627445 


AA013538 


7 . 00E-117 


placenta 


MM90D01 


90D01 


86891 


D28708 




embryonal 
carcinoma cell 
line F9 


mn!6c06 . rl 


538090 
(5') 


759070 


AA120290 


3 . 60E-165 


Beddington 
mouse embryonic 
region 


vc56h04 . si 


778615 
(5') 


1057643 


AA413629 


1 . 80E-17 


embryo 


vc7lell .si 


780044 
(5') 


1058954 


AA414940 


1.70E-70 


embryo 


vi05g03 . rl 


902932 
(5' ) 


1162950 


AA518168 


9 . 80E-135 


Barstead mouse 
myotubes MPLRB5 


vwl4el2 . rl 


1243822 
(5«) 


1544529 


AA822672 


2 .40E-88 


thymus 


vw50a07 . rl 


1247220 
(5') 


1561565 


AA839433 


1.20E-126 


mammary gland 


vx63g07 . rl 


1279932 
<5«) 


1624581 


AA896550 


4 .70E-192 


macrophage 


vy81c07 . rl 


1312620 
(5«) 


1662020 


AA931012 


3 . 10E-108 


macrophage 


mn!6c06 .yl 


538090 
(5') 




AI325780 


E-120 


Beddington 
mouse embryonic 
region 
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Legend to table 8 : EST-ID: identification number of EST 
(Expressed Sequence Tag) ; Clone ID: identification number of 
the clone; NCBI-ID: identification number according to NCBI 
database; Genbank-ID: identification number according to the 
Genbank database; P- score: index which indicates sequence 
homologies according to the BLAST algoritm (Altschul et al . , 
1990) ; Tissue: tissue used to isolate EST 
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-Annex to the application documents - subsequently filed sequences listing 
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40 
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SEQUENCE LISTING 

<110> VLAAMS INTERUNIVERSITAIR INSTITUUT VOOR BIOTECHNOL 

<120> Novel cDNAs encoding catenin-binding proteins with 
function in signalling and/or gene regulation 

<130> V1/001-041/EP 

<140> ep 99201543.8 
<141> 1999-05-17 

<160> 83 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR274F 
<400> 1 

actgctgaag atgttccaat 



<210> 2 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR291F 
<400> 2 

aggatcccta catttgtaaa 

<210> 3 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR293F 
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<400> 3 

ttgtgatgac tgtgggaaag 



20 



<210> 4 
<211> 20 
<212> DNA 

10 <213> Artificial Sequence 



15 



20 



25 



30 



35 



45 



50 



<220> 

<223> Description of Artificial Sequence: primer FVR308R 
<400> 4 

cctgggcaac atagcgagat 20 



<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR309R 
<400> 5 

ggactctcct cttgggtttg 20 



<210> 6 

<211> 20 

<212> DNA 

<213> Artificial Sequence 

40 <220> 

<223> Description of Artificial Sequence: primer FVR310R 



<400> 6 

tctgtttctg gccttgattc 20 



<210> 1 
<2U> 27 
<212> DNA 

<213> Artificial Sequence 



<220> 

55 <223> Description of Artificial Sequence: primer FVR34 5F 
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<400> 7 

gcactatggc cagaaacaga aatcaga 



<210> 8 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR34 6R 
<400> 8 

ggaattcctg ggcagtcaca ttcaaag 

<210> 9 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR347R 
<400> 9 

ggaattccat atgctgcttt aagtcag 

<210> 10 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR3S9R 
<<300> 10 

gcggttcttc atcagtttgg 

<210> 11 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR360R 
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<400> 11 

ctcttgggtt tgctggttga 

<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR4 21R 
<4O0> 12 

aacgagaagg gtgtagagtc 

<210> 13 
<211> 20 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR462R 
<4Q0> 13 

tcctgtaata gtggcttcct 

<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR463F 
<400> 14 

tgatgactgt gggaaaggct 

<210> 15 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR464R 
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<400> 15 

ggaggcggag gttgtggtga 

<210> 16 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer FVR4 65P. 
<400> 16 

cctccctcat cccatcttaa 

<210> 17 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR508F 
<400> 17 

ccaaagcctc cagcctgaga 

<210> 18 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR509R 
<4O0> 18 

aggccgaggc gaacagatca 

<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR510F 
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<400> 19 

gctgggatta caggcgtgaa 

<210> 20 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence; primer FVR51IF 
<400> 20 

tcaagcagtg aactctacct 

<210> 21 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR512R 
<400> 21 

aactctatta atttacatgc 

<210> 22 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR513F 
<400> 22 

aacaatggtg aacatggaca 

<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR514R 
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<400> 23 

tcactacatt tatgaggcaa 

<210> 24 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR515F 
<400> 24 

ccatgagaca act-gattat 

<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR516F 
<400> 25 

gcgtgatagt ttgtagttta 



<210> 26 
<2il> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVRS17F 
<400> 26 

cgaccagaac taaaatgcaa 

<210> 27 

<211> 20 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR518F 
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<400> 27 

agatggtctc gttctgttgt 

<210> 28 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR519F' 
<400> 28 

tgcctgactc ttgcccaaat 

<210> 29 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR660F 
<400> 29 

atcgtcagcg acataggtca atggaatttt ctctgat 

<210> 30 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR661R 
<400> 30 

ataagaatgc ggccgctgtt gtctcatgga ctggaag 

<210> 31 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR662F 
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<400> 31 

ataagaatgc ggccgctatg aatgaqtatc ctaaaa 

<210> 32 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; primer FVR663R 
<4D0> 32 

cggatacagc atagcgtaga aaaggcagtg tggtc 

<210> 33 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1043F 

<400> 33 

cgtggatccg aaaagatatg aatgagtat 

<210> 34 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1044F 

<400> 34 

cctctcgagc aaagttcaca tttatagag 

<210> 35 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence: primer 
FVR1045R 

<4D0> 35 

ggaattcgcc tctataaatg tgaactt 

<210> 36 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR104 6R 

<400> 36 

ccgctcgaga agttaaagag aataatcaa 

<210> 3*7 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1237F 

<400> 37 

gcaattcctg aatgagtatc ctaaaaaa 

<210> 38 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1238R 

<400> 38 

atgcatgctg tagaaaaggc agtgtggt 

<210> 39 
<211> 30 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1240F 

<400> 39 

cgtcgcggcc ctgcagatgg attcaatgga 

<210> 40 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1242R 

<400> 40 

tccccccggg gggatgaatt tattatttta 

<210> 41 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1411F 

<400> 41 

tcccccgggt atgaatgagt atcctaaaaa a 

<210> 42 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1412R 

<400> 42 

aaaagtcgac ggccactgct attagctctc 
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<210> 43 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1413F 

<400> 43 

ggaattcttc tataaatgtg aactttgt 

<210> 44 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1414R 

<400> 44 

aaaagtcgac aagttaaaga gaataatcaa 

<210> 45 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR174F 
<400> 45 

accactacaa tggatgatgt 

<210> 46 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR175F 
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<400> 46 

atcatcggaa gagagtagta 



<210> 47 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR192R 
<400> 47 

taaaagaagg caaaacgatg 

<210> 48 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR217R 
<400> 48 

aaaatcataa atcataagaa 

<210> 49 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer KVR239F 
<220> 

<221> misc_feature 

<222> (22} . . (23) 

<223> modif ied-base : inosine 

<220> 

<221> misc_feature 

<222> (27) . . (28) 

<223> modif ied-base : inosine 

<220> 

<221> misc feature 
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<222> (32) . . (33} 

<223> modified-base : inosine 

<4O0> 49 

gaattcgtcg actagtacgg gnngggnngg gnng 

<210> 50 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR240F 
<400> 50 

gaattcgtcg actagtac 

<210> 51 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR137F 
<400> 51 

accccccggg ggcaacttca cctatcattc 

<210> 52 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 138R 
<400> 52 

gccgccgcct tccttttcat ttccgctctt 

<210> 53 

<2U> 24 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer FVR283 
<400> 53 

cgccagggtt ttcccagtca cgac 

<210> 54 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR234 
<400> 54 

tcacacagga aacagctatg ac 

<210> 55 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR51F 
<400> 55 

cgttccgatc ctctatactg c 

<210> 56 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR54R 
<400> 56 

atttgagtga cgaacagtgt 

<210> 57 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: primer FVR157R 



5 



<400> 



57 



ctgqtcttct tggtcatttt a 



21 



<210> 58 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR160R 
<400> 58 

ttcagatggt ggcagtagag 20 

<210> 59 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FVR7 38F 



<210> 60 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1157R 

<400> 60 

tcagaagcac gacgagcgt 19 



<400> 59 



ttggtattga ttgaagctgc 



20 



35 



50 



<210> 61 



<211> 22 



55 



<212> DNA 

<213> Artificial Sequence 
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10 



15 



20 



25 



30 



45 



<220> 

<223> Description of Artificial Sequence: primer 
FVR1241F 

<400> 61 

cggaattccc gggggcaact tc 22 



<210> 62 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR12 4 3R 

<400> 62 

tcattaagag catatgccag ct 22 



<210> 63 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
35 FVR1244F 

<400> 63 

aattcccggg cgcccagcta gc 22 

40 

<210> 64 
<211> 22 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
50 FVR124 5R 

<400> 64 

tcctccaggg acggccgaaa gc 22 

55 
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<210> 65 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1246F 

<400> 65 

aggttccggc cgtccctgca 

<210> 66 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description cf Artificial Sequence: primer 
FVR1247R 

<400> 66 

ggaatatcgg tacctgctca gc 

<210> 67 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1248F 

<400> 67 

gctgagcagg taccgatatt cc 

<210> 68 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR124 0R 
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<400> 68 

ttggctgcag gtcgacggta tc 



22 



<210> 69 
<211> 22 
<212> DNA 

10 <213> Artificial Sequence 



15 



30 



35 



40 



<220> 

<223> Description of Artificial Sequence: primer 
FVR1250F 

<400> 69 

actggcatat gcactcaata ac 22 



20 

<210> 70 

<211> 22 

<212> DNA 

25 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
FVR1251R 

<400> 70 

cctggaagct gggtacctgt tc 22 



<210> 71 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer FR1252F 



45 <400> 71 

aacaggtacc cagcttccag g 



50 <210> 72 

<211> 20 
<212> DMA 

<213> Artificial Sequence 

55 

<220> 
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<223> Description of Artificial Sequence: primer 
FVR1253R 

<400> 72 

cttggctgca ggtcgactct 20 

<210> 73 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
FVR1311F 

<400> 73 

ctgtgtcccc aggtcatcaa 20 



<210> 74 

<211> 3013 

<212> DNA 

<213> Homo sapiens 

<400> 74 

caacgagttg tagccgcgga gagcaggcgt 
gagtcgcctc actcgtgtga cgacagagtt 
gatgtgatct cgggtcacca caacctccgc 
gcctcccgag tagctcagat tacaggcatg 
cagtcgagac ggggtttccc catgttggtc 
atctgttcgc ctcggcctcc caaagtgctg 
gaggcatttt ttactgtcta cagaaactta 
aagaatta~a ccaaattgaa aagatatgaa 
tctacaccct tctcgttatt cagattcctc 
tggaattttc tctgaccatt gttacagtgt 
ttttgacaac aaagatgatg attctgatac 
agatggaatc aaggccagaa acagaaatca 
aattctagac cacactgcct ttrctacaga 
agagtgtgac tcacctgaat cagtcaacca 
tcacactgct gaagatgttc caattgctgt 
tatagagaca gaaaacaatt cctctgagag 
agctaaactt tgtaaaattc ttgacaagag 
atggccttta ctgagagcta atagcagtgg 
cagcaaatat ttttctgact taaagcagca 
aaatgtgtgt cgagtatgca aqgaaagttt 
caaactgcat gaagaggatc cctacatttg 
tgagaacctc agccagcaca ttgcagacac 



cgatgctggc gcccaaagcc tccagcctga 60 
tcactcctgt tacccaggct ggaggacagt 120 
ctcccggatt caggcgattc tcatgcctca 180 
tgccaccacg cccggctaat tttgtatttt 240 
aggctggtct tgaattcccg acctcaggtg 300 
ggattacagg cgtgaaccac tgcacccggc 360 
ttgtaattca tttttcctca ctccagtagt 420 
tgagtatcct aaaaaaagaa aaaggaagac 480 
tygaataagc agaattgcag atggattcaa 540 
ctgttctatg agacagccag atttaaaata 600 
cgagacgtca aatgacttgc caaaatttgc 660 
gaactacctg gttcccagtc ctgtacttag 720 
aaaatctgct gatattgtaa tttgtgatga 780 
gcaaacccaa gaggagagtc ctatagaagt 840 
agaagtgcat gcgatttctg aggattatga 900 
tctccaagac caaactgatg aagaaccgcc 960 
ccaagctttg aatgtgactg cccagcagaa 1020 
cctctataaa tgtgaacttt gtgagtttaa 10B0 
tatgatcctg aagcataaac qtactgattc 1140 
ctctaccaat atgcttctga tagaacatgc 1200 
taaatactgt gattataaga cagtaatttt 1260 
ccattttagt gatcacctct attggtgtga 1320 
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acagtgtgat gtacagttct cctcaagcag tgaactctac ctacatttcc aggagcacag 1380 
ctgtgatgaa cagtacttgt gtcagttctg tgaacatgaa actaatgatc cagaagactt 1440 
gcatagccat gtggtaaatg agcatgcatg taaattaata gagttaagtg ataagtataa 1500 
caatggtgaa catggacaai atagcctctt aagcaaaatt acctttgaca aatgtaaaaa 1560 
cttctttgta tgtcaagtat gtggttttcg gagtagactt cacacaaatg ttaacaggca 1620 
tgttgctatt gaacatacaa aaatttttcc tcatgtttgt gatgactgtg ggaaaggctt 1680 
ttcaagtatg ctagaatatu gcaagcattt aaattcacat ttatctgaag ggatttattt 1740 
atgtcaatat tgtgaatatt caacaggaca aattgaagat cttaaaattc atctagattt 1800 
caagcattca gctgacttgc ctcataaatg tagtgactgc ttgatgaggt ttggaaatga i860 
aagggaatta ataagtcacc ttccagtcca tgagacaact tgattattct ctttaaccta 1920 
cagaatgtta gtttaaaata ataaattcat cctttttttg gagatgalta aatggatgat 1980 
tgtaaacaca acttatgaaa tctgccttta acaagtaact tttttaaart ataaaatttt 2040 
attggcattg ctccattttc tgtatataaa tatatcttca atgtggtatt ttcaattgcg 2100 
tgatagtttg tagtttcaac cactcttggt gactgtcatc ctgtttctic catattctct 2160 
gatttcatga attgaaaaga aacaaatgta ttgaagaagt gagctacagt tttccttcct 2220 
taaccatggg tgctagtaac tttttaaaac tcaagacaag attagttttt tatgtgtgaa 2280 
gtcattaaat tattacacga ccaqaactaa aatgcaatat acagttaagt ccacggatac 2340 
tcccattaat gagaaataac actaggaagc cactattaca ggaagaaaag atttggtttt 2400 
catggcagtc tgttttttta aaaaaaaatt tttgagccac tatctattgt tgaatatttt 2460 
aagatgggat gagggaggaa ctaataaggg cttacacaat aaaaaataac tatatcataa 2520 
ctcattcata acttgatgtt tcattttctg ttgaggaacc ataaattcat tcacagactt 2580 
aatatttttt tcttagagat ggtctcgttc tgttgtccag gatggagtgc agtggttgat 2640 
catagctccc gggccgtagt ctcccaggct cgagcaatcc tcccacttca ccctcctgca 2700 
tagctaggac tacaggcatg tggcaccatg cccgctaagt ttttaaattt cttgtagaga 2760 
tgagatctcg ctatgttgcc caggccagtc tcaaactcct ggactcaagc aatcctccca 2820 
ccttggcctt ccaaatcact gggattatag gcatgagcca ttatgcctga ctcttgccca 2880 
aatttctgat gtcaaattgt tcattgacag aaaacccact gaagtattta aagttaggaa 2940 
gatctgggag ataggggttg ctggcatgaa aatgtataac LLacaacatt tattaataaa 3000 
atgataaatt age 3013 

<210> 75 
<211> 24 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: zinc finger 
motif: ZF1 

<400> 75 

Tyr Lys Cys Glu Leu Cys Glu Phe Asn Ser Lys Tyr Phe Ser Asp Leu 
15 10 15 

Lys Gin His Met lie Leu Lys His 
20 
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<210> 76 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: zinc finger 
10 motif: ZF2 

<400> 76 

Asn Val Cys Arg Val Cys Lys Glu Ser Phe Ser Thr Asn Met Leu Leu 
15 I S 10 15 

He Glu His Ala Lys Leu His 
20 

20 



25 



30 



<210> 77 
<2il> 24 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: zinc finger 
motif: ZF3 



<400> 77 

35 Tyr lie Cys Lys Tyr Cys Asp Tyr Lys Thr Val He Phe Glu Asn Leu 

15 10 15 



40 



Ser Gin His He Ala Asp Thr His 

20 



<210> 78 

45 <2U> 23 

<212> PRT 

<213> Artificial Sequence 

50 <220> 

<223> Description of Artificial Sequence: zinc finger 
motif: ZF4 



55 



<400> 78 

Tyr Trp Cys Glu Gin Cys Asp Val Gin Phe Ser Ser Ser Ser Glu Leu 
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5 10 



15 



Tyr Leu His Phe Gin Glu His 
20 



10 



15 



20 



25 



40 



45 



50 



<210> 79 
<211> 24 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: zinc finger 
motif: ZF5 

<400> 79 

Tyr Leu Cys Gin Phe Cys Glu His Glu Thr Asn Asp Pro Glu Asp Leu 
15 10 15 

His Ser His Val Val Asn Glu His 
20 



30 

<210> 80 
<211> 24 
<212> PRT 

35 <213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: zinc finger 
motif: ZF6 

<400> 80 

Phe Val Cys Gin Val Cys Gly Phe Arg Ser Arg Leu His Thr Asn Val 
1 5 10 15 

Asn Arg His Val Ala lie Glu His 
20 



<210> 81 
<211> 23 
55 <212> PRT 



<213> Artificial Sequence 
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10 



15 



20 



25 



40 



45 



50 



<220> 

<223> Description of Artificial Sequence: zinc finger 
motif: 2F7 

<400> 81 

His Val Cys Asp Asp Cys Gly Lys Gly Phe Ser Ser Met Leu Glu Tyr 
15 10 15 

Cys Lys His Leu Asn Ser His 
20 



<210> 82 
<211> 24 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: zinc finger 
motif: ZF8 



<400> 82 

Tyr Leu Cys Gin Tyr Cys Glu Tyr Ser Thr Gly Gin He Glu Asp Leu 
30 l 5 10 15 

Lys He His Leu Asp Phe Lys His 
20 

35 



<210> 83 
<211> 23 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: zinc finger 
motif: ZF9 

<400> 83 

His Lys Cys Ser Asp Cys Leu Met Arg Phe Gly Asn Glu Arg Glu Leu 
15 10 15 



He Ser His Leu Pro Val His 
55 20 
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Claims 

1. An isolated and/or recombinant nucleic acid or a functional fragment, hornologue or derivative thereof, 
corresponding to a catenin-binding protein with function in signal transduction or gene regulatory pathways. 

2. An isolated and/or recombinant nucleic acid or a functional fragment, hornologue or derivative thereof, according 
to claim 1 wherein said molecule is corresponding to a zinc finger gene with a nucleic acid sequence as shown in 
figure 1 and encoding a zinc finger protein, or fragment thereof, capable of complexing with catenin. 

3. A nucleic acid according to claim 1 or 2 wherein said protein is capable of nuclear translocation of a catenin. 

4. A nucleic acid according to claim 1 ,2 or 3 wherein said catenin comprises aN-catenin. 

5. A nucleic acid according to anyone of claims 1 to 4 which is a cDNA molecule. 

6. An expression vector comprising a nucleic acid according to anyone of claims 1 to 5. 

7. A cell comprising a nucleic acid according to anyone of claims 1 to 5 or a vector according to claim 6. 

8. A cell according to claim 7 wherein said cell is a yeast cell or a mammalian cell. 

9. A cell capable of expressing a zinc finger protein or fragment thereof, said protein capable of complexing with 
a catenin. 

10. A cell according to claim 9 wherein said catenin comprises aN-catenin. 

11. An animal comprising a cell according to anyone of claims 7, 9 or 10. 

12. A zinc finger protein or derivative or fragment thereof, said protein capable of complexing with a catenin, 
preferably with aN-catenin. 

13. A protein or derivative or fragment thereof according to claim 12 encoded by a nucleic acid according to anyone 
of claims 1 to 5. 

14. An antibody specifically directed against a protein or derivative or fragment thereof according to claim 13. 

15. A method for identifying a candidate drug comprising use of a cell according to any of claims 7 to 10 or an 
animal according to claim 11 or a protein according to claim 12 or 13 or an antibody according to claim 14. 

35 16. A method according to claim 15 wherein said drug is for use in a cancer patient or in a patient with a 
neurological disorder. 

17. A method for diagnosing, prognosing and/or treating cancer or neurological disorders, comprising use of a 
nucleic acid according to anyone of claims 1 to 5, a cell according to any of claims 7 to 10 or an animal 
according to claim 11 or a protein according to claim 12 or 13 or an antibody according to claim 14. 
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1 CAACGAGTTG . TAGCCGCGGA . GAGCAGGCGT . CGATGCTGGC . GCCCAAAGCC . 50 



51 TCCAGCCTGA . GAGTCG 



|CTC. ACTCGTGTGA. CGACAGAGTT . TCACTCCTGT 



100 



101 



TACCCAGGCT . GGAGGACAGT . GATGTGATCT . CGGGTCACCA . C AACCTCCGC 



150 



151 



CTCCCGGATT. CAGGCGATTC . TCATGCCTCA. GCCTCCCGAG . TAGCTCAGAT i 



00 



201 



TACAGGCATG . TGCCACCACG . CCCGGCTAAT . TTTGTATTTT . CAGTCGAGAC 



250 



251 



GGGGTTTCCC . CATGTTGGTC . AGGCTGGTCT . TGAATTCCCG . ACCTCAGGTG 



300 



301 



ATCTGTTCGC . CTCGGCCTCC . CAAAGTGCTG . GGATTACAGG. CGTGAACCAC 



3 50 



351 



TGCACCCGGC. G 



^GGCATTTT . TTACTGTCTA. CAGAAACTTA. TTGTAATTCA 4 00 



401 TTTTTCCTC A . CTCCAGTAGT . AAGAATTATA . CCAAA1 TGA A . AAGA1 'ATG AA 4 50 



M 



N 



451 TGAGTA TCCT. AAAAAAAGAA. AAAGG AAGAC . TCTACACCCT. TCTCGTTATT 500 
E Y 



K K R 



K R K 



LHP 



S R Y 



putative NLS 

501 CAGATTCCTC . TGGAATAAGC . AG AATTGC AG . ATGG ATTCAA . TGG AATTTTC 550 
SDSS G I S R I A DGFN GIF 

551 TCTGATCATT. GTTACAGTGT. CTGTTCTATG . AGACAGCCAG . ATTTAAAATA 600 
SDH CYSV CSM RQP DLKY 

601 TTTTGACAAC. AAAGATGATG . ATTCTGATAC . CGAGACGTCA. AATGACTTGC 650 
FDN K D D DSDT ETS NDL 

651 CAAAATTTGC. AGATGGAATC. AAGGCCAGAA . ACAGAAATCA . GAACTACCTG 700 
PKF ADGI KAR N R N Q N Y L 

peptide 

701 GTTCCCAGTC . CTGTACTTAG . AATTCTAGAC . CACACTGCCT . TTTCTACAGA 7 50 
VPS PVLR ILD HTA FSTE 

7 51 AAAATCTGCT . GATATTGTAA . TTTGTGATGA . AGAGTGTGAC . TCACCTGAAT 800 
KSADIV ICDE ECD SPE 



801 CAGTCAACCA . GCAAACCCAA . GAGGAGAGTC - CTATAGAAGT . TCACACTGCT 850 
SVNQ QTQ EES PIEV HTA 
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851 GAAGATGTTC . CAATTGCTGT. AGAAGTGCAT. GCGATTTCTG . AGGATTATGA 900 
EDV P I A V EVH AIS EDYD 

901 TATAGAGACA. GAAAACAATT. CCTCTGAGAG. TCTCCAAGAC. CAAACTGATG 9 50 
IET ENN SSES L Q D QTD 

^ KVR359R 

9 51 AAGAACCGCC , AGCTAAACTT . TGTAAAATTC . TTGACAAGAG . CCAAGCTTTG 1000 
EEPP A K L C K I L DKS Q A L 



1001 AATGTGACTG. CCCAGCAGAA . ATGGCCTTTA. CTGAGAGCTA , ATAGCAGTGG 1050 
NVT A Q Q K WPL LRA NSSG 

1051 CCTCTATAAA . TGTGAACTTT . GTGAGTTTAA . CAGCAAATAT . TTTTCTGACT 1100 
L | Y K CELC EFN SKY FSD 

H> Zinc finger 1 

1101 TAAAGCAGC A . TATGATCCTG . AAGCATAAAC . GTACTGATTC . AAATGTGTGT 1150 
LKQH MIL KHflK RTDSljNVC 

Zinc finger 1<H H> Zinc finger 2 

1151 CGAGTATGCA . AGGAAAGTTT . CTCTACCAAT . ATGCTTCTGA . TAGAACATGC 1200 
RVC KESF STN M L L I EHA 

12 01 C AAACTGCAT . G AAGAGGATC . CCTAC ATTTG . TAAATACTGT . G ATTATAAGA 12 5 0 
KLHflEEDflPYIC KYC DYK 

Zinc finger 2 M> Zinc finger 3 

12 51 CAGTAATTTT . TG AGAACCTC . AGCC AGC AC A . TTGC AG ACAC . CCATTTT AGT 1300 

TVIF E N L SQHI A D T H p F S 

Zinc finger 3 <J=^ 

1301 GATCACCTCT . ATTGGTGTGA . ACAGTGTGAT . GTACAGTTCT . CCTCAAGCAG 13 50 
DHLfjYWCE QCD VQF SSSS 

H> Zinc finger 4 

13 51 TGAACTCTAC . CTACATTTCC . AGGAGCACAG . CTGTGATGAA. CAGTACTTGT 1400 

ELY LHF QEHflS CDE Q[]YL 

Zinc finger 4 <>=f M> Zinc finger 5 

1401 GTCAGTTCTG . TGAAC ATGAA . ACTAATG ATC . CAGAAGACTT . GCATAGCC AT 1450 
CQFC EHE T N D P E D L HSH 

14 51 GTGGTAAATG . AGC ATGC ATG . TAAATTAATA . GAGTTAAGTG . ATAAGTATAA 1500 

VVNE H[| A C KLI ELS DKYN 

Zinc finger 5 <F* 

1501 CAATGGTGAA . CATGGACAAT . ATAGCCTCTT . AAGCAAAATT . ACCTTTGACA 1550 
NGE HGQ YSLL SKI TFD 

1551 AATGTAAAAA . CTTCTTTGTA . TGTCAAGTAT . GTGGTTTTCG . GAGTAGACTT 1600 
KCKN F (IF V CQV CGFR SRI* 

lH> Zinc finger 6 

1601 CACACAAATG. TTAACAGGCA . TGTTGCTATT . GAACATACAA . AAATTTTTCC 1650 
HTN VNRH VAI EH [IT KIFP 

Zinc finger 6 <H* 

1651 TCATGTTTGT . GATGACTGTG . GGAAAGGCTT . TTCAAGTATG . CTAGAATATT 17 0 0 
[1HVC DDC GKGF SSM LEY* 

M> Zinc finger 7 
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1701 GCAAGCATTT . AAATTCACAT . TTATCTGAAG . GGATTTATTT . ATGTCAATAT 17 5 0 
CKHL NSHflLSEG IflYL CQY 

Zinc finger 7 <M H> Zinc finger 8 

17 51 TGTGAATATT . CAACAGGACA. AATTGAAGAT. CTTAAAATTC . ATCTAGATTT 1800 
CEYS TGQ IED L K I HLDF 

1801 CAAGCATTCA . GCTGACTTGC . CTCATAAATG . TAGTGACTGC . TTGATGAGGT 1850 
K H|S ADLPflHKC S D C LMR 

Zinc finger SC]=^ !!={> Zinc finger 9 

1851 TTGGAAATGA . AAGGGAATTA . ATAAGTCACC . TTCCAGTCCA . TGAGACAACT 1900 
F G N E REL ISHL P V H j| E T T 

finger 9 



Zinc finger ! 



1901 fTGAfTTATTCT . CTTTAACTTA . CAGAATGTTA . GTTTAAAATA . ATAAATTCAT 1950 



1951 CCTTTTTTTG . GAGATGATTA. AATGGATGAT . TGTAAACACA . ACTTATGAAA 2 000 



2001 TCTGCCTTTA . AC AAGTAACT . TTTTTAAATT . ATAAAATTTT . ATTGGC ATTG 2050 



2051 CTCCATTTTC . TGTATATAAA. TATATCTTTA. ATGTGGTATT . TTCAATTGCG 2100 



2101 TGATAGTTTG . TAGTTTCAAC . CACTCTTGGT . GACTGTCATC . CTGTTTCTTC 2150 



2151 CATATTCTCT. GATTTCATGA. ATTGAAAAGA. AACAAATGTA. TTGAAGAAGT 2200 



2201 GAGCTACAGT. TTTCCTTCCT. TAACCATGGG. TGCTAGTAAC . TTTTTAAAAC 2250 



2251 TCAAGACAAG . ATTAGTTTTT . TATGTGTGAA. GTCATTAAAT . TATTACACGA 2300 



2301 CC AGAACTAA . AATGCAATAT . ACAGTTAAGT . CC ACGGATAC . TCCCATTAAT 2350 



2351 GAGAAATAAC. ACTAGGAAGC . CACTATTACA. GGAAGAAAAG . ATTTGGTTTT 2400 



2401 CATGGCAGTC . TGTTTTTTTA. AAAAAAAATT. TTTGAGCCAC . TATCTATTGT 2450 



24 51 TGAATATTTT . AAGATGGGAT . GAGGGAGGAA . CTAATAAGGG . CTTACACAAT 2500 



2501 AAAAAATAAC . TATATCATAA. CTCATTCATA. ACTTGATGTT . TCATTTTCTG 2550 
2551 TTG AGGAACC . ATAAATTCAT . TC AC AG AC TT . PJ^^^^^^^^^^^^M 2600 
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2601 


GGTCTCGTTC . 


, TGTTGTCC AG . 


GATGGAGTGC . 


. AGTGGTTGAT . 


, CATAGCTCCC 






2651 


GGGCCGTAGT . 


CTCCCAGGCT. 


CGAGCAATCC . 


. TCCCACTTCA . 


CCCTCCTGCA 






270l| 


TAG C TAG G AC . 


TACAGGCATG . 


TGGCACCATG . 


CCCGCTAAGT . 


TTTTAAATTT 














2751 


CTTGTAGAGA . 


TGAGATCTCG . 


CTATGTTGCC . 


CAGGCCAGTC . 


TCAAACTCCT 






280l| 


GGACTCAAGC . 


AATCCTCCCA. 


CCTTGGCCTT . 


CCAAATCACT. 


GGGATTATAG 



2650 
2700 
2750 
2800 
2850 



28511 



GCATGAGCCA. TTATGCCTGA. CTCT 



rTGCCCA . AATTTCTG AT . GTCAAATTGT 2900 
2 901 TCATTGACAG. AAAACCCACT. GAAGTATTTA . AAGTTAGGAA. GATCTGGGAG 29 50 
2 951 ATAGGGGTTG. CTGGC ATGAA . AATGTATAAC . TTACAACATT . TATTAATAAA 3000 
3000 ATGATAAATT . AGC 3013 
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